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l!  SUMMARY 


This  three-year  project  was  first  initiated  under  the  title  “Alternative  Measures  of  Yield,  Testing 
and  Implementation  of  Yield  Measures,  and  the  Mantle  Attenuation  Operator.”  During  the  first 
year  of  funding  under  this  title,  work  was  completed  on  the  testing  of  alternative  measures  of  body 
wave  magnitude.  The  purpose  of  this  study  was  to  determine  whether  magnitude  measures  based  on 
integrated  coda  energy  exhibited  any  less  scatter  over  local  and  teleseismic  networks  than  classical 
magnitude  measures  based  on  the  peak  amplitude  within  several  seconds  of  P  onset  time.  The 
results  of  that  study  were  published  (Bullitt  and  Cormier, 1984)  and  reported  in  the  first  year  of  the 
project.  In  summary,  it  was  found  that  alternative  measures  of  body  wave  magnitude  generally  often 
exhibit  as  much  scatter  as  classical  measures  of  magnitude,  although  coda  magnitudes  usually  have 
slightly  less  scatter  than  spectral  magnitudes  and  classical  magnitudes.  In  the  cases  investigated, 
these  differences  were  usually  not  statistically  significant.  It  was  concluded  that  the  reason  for  the 
marginal  improvement  of  coda  magnitudes  was  due  to  the  fact  that  a  portion  of  the  coda  must  be 
focused  and  defocused  by  three-dimensional  structure  in  step  with  the  first  several  cycles  of  the  P 
wave. 


Another  completed  task  related  to  the  first  year  of  the  project  was  an  investigation  of  the 
intrinsic  attenuation  of  the  Earth’s  mantle  along  selected  paths  from  the  Sea  of  Okhotsk  to  Regional 
Seismic  Test  Network  (RSTN)  and  Global  Digital  Seismic  Network  (GDSN)  stations  in  North 
America.  The  conclusion  of  this  study  was  that  the  intrinsic  attenuation  in  the  mantle  beneath 
eastern  North  America  is  both  depth  and  frequency  dependent  and  that  spectral  and  time  domain 
studies  of  attenuation  can  be  reconciled  in  the  frequency  band  up  to  2  Hz.  The  effects  of  scattering 
in  the  Earth’s  upper  mantle  and  crust  do  not  seem  to  dominate  those  of  intrinsic  attenuation  in  this 
frequency  band.  The  results  of  that  study  were  published  as  a  refereed  paper  (Choy  and  Cormier, 
1986),  and  are  included  as  part  2  of  the  current  report. 


In  the  second  and  third  year’s  funding,  the  scope  of  the  pioject  was  expanded  to  investigate  a 
broadet  range  of  scattering,  attenuation,  and  wave  propagation  problems.  The  title  of  the  project 
changed  to  “Effects  of  Lateral  Heterogeneities  on  the  Propagation,  Scattering,  and  Attenuation 
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of  Seismic  Waves,  and  the  Characterization  of  the  Seismic  Source."  Prof.  Aki  was  added  as  co- 
investigator  along  with  Prof.  Toksoz  and  Dr.  Cormier.  The  focus  of  the  project  was  then  divided 
between  source  problems  in  source  inversion  from  recorded  waveforms  and  problems  related  to 
scattering  and  wave  propagation  in  three-dimensional,  heterogeneous  media.  The  results  of  source 
inversion  of  body  waves  were  reported  in  previous  semi-annual  reports  and  formed  the  basis  of  the 
work  that  formed  the  thesis  by  John  Nabelek.  A  significant  result  of  this  work  was  that  short 
period  and  broadband  waveforms  can  improve  the  depth  resolution  determined  earthquakes  and 
underground  nuclear  explosions,  forming  a  powerful  discriminant. 

Scattering  was  studied  theoretically  and  observationally  using  the  results  of  phase  and  amplitude 
studies  at  seismic  arrays,  as  well  as  the  shapes  of  coda  waves  following  the  S  waves  of  local  and 
regional  earthquakes.  These  studies  are  included  in  the  thesis  by  Ru-shan  Wu.  Chapters  of  that 
thesis  have  been  excerpted  in  earlier  semi-annual  reports.  The  significant  result  of  that  work  is 
that  the  Earth’s  lithosphere  must  possess  multiple  scales  of  heterogeneity  in  order  to  explain  both 
the  amplitude  and  phase  fluctuations  at  large  arrays  as  well  as  the  shapes  of  local  S  coda.  Scale 
lengths  on  the  order  of  10  km  and  less  than  1  km  are  needed  to  explain  both  data  sets.  Another 
important  result  of  this  work  was  that  a  method  was  proposed  for  separating  intrinsic  attenuation 
and  scattering  attenuation  based  on  the  shape  of  the  coda  following  the  S  waves  of  local  and  regional 
events.  A  data  set  of  deep  and  shallow  focus  events  in  the  Hindu  Kush  region  of  Asia  was  studied 
using  this  method.  Scattering  attenuation  was  found  to  dominate  intrinsic  attenuation  only  in  the 
highest  frequency  band  above  10  Hz. 

In  the  final  area  of  research,  an  asymptotically  approximate  method  of  synthesizing  seismograms 
in  three-dimensionally  varying  media  was  tested  and  implemented.  This  formed  the  basis  of  a  thesis 
by  Robert  Nowack.  This  work  is  the  only  remaining  task  that  has  not  been  included  in  previous 
reports,  and  as  such,  is  included  as  part  1  of  this  report.  The  results  of  this  study  will  be  important 
for  predicting  the  amplitude  fluctuations  induced  by  larger  scale  structures  (10  km  and  greater 
scale  lengths)  in  both  source  and  receiver  regions.  Such  structures  can  be  responsible  for  amplitude 
fluctuations  larger  than  that  induced  by  regional  variations  in  intrinsic  attenuation  and  are  likely 
to  be  responsible  for  much  of  fluctuations  observed  in  measures  of  magnitude  at  teleseismic  range. 
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2.  WAVE  PROPAGATION  IN  LATERALLY  VARYING  MEDIA  AND  ITERATIVE 
INVERSION  FOR  VELOCITY 

2. 1  INTRODUCTION 

In  recent  years,  a  variety  of  techniques  have  been  developed  to 
compute  seismograms  in  laterally  varying  media.  These  include  the  finite 
difference  and  finite  element  methods  which  are  general  techniques  but 
expensive.  Thus  they  are  limited  to  small  grids  and  lower  frequencies  (see 
for  example  Boore  [1972],  Smith  [1975],  Kelly  et.al.  [1976]).  Perturbation 
techniques  are  also  useful  primarily  at  lower  frequencies  (see  Kennett 
[1972a,  1972b],  Larner  [1970],  Aki  and  Richards  [i960]).  The  boundary 
integral  method  is  another  technique  recently  investigated  in  the 
seismological  literature  (see  Sanchez-Sesma  and  Esquivel  [1979,, 
Sanchez-Sesma  et.al.  [1982]).  The  remaining  methods  to  be  described  are 
primarily  high  frequency  asymptotic  methods. 

The  ray  method  is  one  of  the  most  commonly  used  techniques  for 
propagation  in  laterally  varying  media.  It  is  based  on  a  series  solution  to 
the  wave  equation  (elastodynamic  equation)  in  inverse  powers  of  frequency 
(Kline  and  Kay  [1965],  Cerveny  et.al.  [1977])  In  practice,  usually  only  the 
first  term  is  retained.  Examples  of  seismograms  computed  using  the  ray 
method  are  found  in  May  and  Hron  [1978],  Cerveny  [1979,,  McMechan  and 
Mooney  [1980]. 

Simple  estimates  of  the  validity  of  the  ray  method  are  based  on  several 
cr.terion  (Kravtsov  and  Orlov  [1980]).  First,  the  parameters  of  the  medium 
as  well  as  the  amplitude  and  phase  gradient  of  the  wavefront  should  be 
slowly  varying  over  the  cross  section  of  the  Fresnel  volume.  The  boundary 


of  the  Fresnel  volume  is  defined  by  u rg  -urv  =  tt  ,  where  rg  is  the  phase 
time  for  the  geometrical  ray  and  rv  Is  the  phase  time  for  a  secondary 
virtual  ray.  The  Fresnel  volume  is  the  physical  region  around  a  geometric 
ray  for  finite  frequencies.  Next,  the  phase  difference  corresponding  to  two 
rays  arriving  at  the  same  point  should  be  greater  than  tt  or  a  caustic 
results.  Finally,  the  ray  method  has  the  range  limitation 


where  L  is  the  range,  \  is  the  wavelength,  and  a  is  the  inhomogeneity 
scale.  This  limitation  is  a  result  of  the  increasing  size  of  the  Fresnel  radius 
with  distance. 

A  disadvantage  of  the  ray  method  even  at  high  frequencies  is  that  the 
method  is  invalid  at  caustics  where  it  incorrectly  predicts  infinite 
amplitudes.  These  singularities  can  be  smoothed  over  by  properly 
incorporating  the  effects  of  adjacent  rays  into  the  solution.  One  way  to 
correct  the  ray  method  is  to  construct  a  local  asymptotic  solution  in  the 
vicinity  of  the  caustic  (see  Ludwig  [  L 966]).  This  method  may  be  used  to 
construct  a  uniform  solution,  but  may  be  inconvenient  for  complicated 
structures. 

Certain  diffraction  effects  can  be  included  in  the  ray  method  using  an 
extension  known  as  the  geometrical  theory  of  diffraction  (Keller  [1962'' 
Keller  defined  new  diffracted  ray  types  which  include  edge  rays  and  vertex 


rays.  A  simple  review  of  this  theory  is  given  by  Keller  [1978].  Extensions  to 
seismology  are  given  for  example  by  Klem-Musatov  and  Aizenberg  [1984]. 
In  this  approach,  diffraction  coefficients  must  be  determined  for  different 
cases. 

A  simplification  of  the  ray  method  applied  to  vector  wave  equations, 
such  as  the  elastodynamic  equation,  was  the  introduction  of  the  ray 
centered  coordinates  by  Popov  and  Psencik  [1976,1978].  This  is  a  special 
orthogonal  curvilinear  coordinate  system  which  moves  along  a  particular 
ray.  In  this  coordinate  system,  the  first  order  transport  equations  for  the 
P  and  S  displacement  components  decouple.  This  results  in  three  scalar 
transport  equations  rather  than  vector  equations.  Writing  the  wave 
equation  in  this  local  coordinate  system  along  a  given  ray  is  also  a 
fundamental  step  in  the  Gaussian  beam  method. 

The  Maslov  method  is  a  semi-classical  method  which  can  be  applied  to 
propagation  problems  to  extend  the  ray  method  (see  Kravtsov  [  L968j. 
Chapman  and  Drummond  [  1983],  and  Ziolkowski  and  Deschamps  [  L  984.') 
The  method  is  based  on  the  fact  that  a  ray  is  a  construct  dependent  not 
just  on  location,  £  ,  but  also  on  the  direction  represented  by  the  slowness 
vector,  p  .  In  the  space  (f  p)  there  are  no  caustics.  This  can  be  seen  by 
noting  that  is  a  state  vector  for  the  ray  system.  Thus  if  two 

rays  have  the  same  state  vector  for  a  given  time,  then  they  will  the  same 
for  all  time  since  this  vector  contains  all  the  information  needed  to 


construct  the  ray.  Caustics  occur  upon  projection  from- this  extended 
space  to  various  three  dimensional  configuration  spaces.  The  caustics  will 
have  different  locations  depending  on  the  configuration  space.  In  the 
(i ,y  ,z)  space,  the  method  reduces  to  the  standard  ray  method.  In  the 
( px,y,z )  space  in  a  vertically  varying  medium,  the  method  would  be 
similar  to  the  WKBJ  method  discussed  by  Chapman  [1978]  where  the  field  is 
the  result  of  an  integral  over  horizontal  slowness,  px  ,  which  is  then 
evaluated  by  an  equal  phase  method.  The  idea  of  the  Maslov  method  is  to 
choose  the  domain  of  computation  which  avoids  caustics  near  different 
receivers.  A  problem  with  this  method  is  that  the  appropriate  domain  of 
computation  is  not  known  prior  to  starting  the  computations.  The  final 
solution  is  the  result  of  piecing  together  the  asymptotic  solutions  which 
are  found  to  be  valid  in  different  ranges. 

Another  technique  for  the  propagation  in  laterally  varying  media  is 
based  on  the  parabolic  approximation  to  the  wave  equation.  It  was  first 
derived  by  Leontovich  and  Fock  in  1946  for  radio  wave  propagation  (see 
Fock  [1965],  ch  11).  The  parabolic  approximation  assumes  small  angle 
forward  scattering  about  some  preferred  direction  of  propagation.  It 
results  in  a  one  way  operator,  since  back  travelling  energy  is  neglected.  An 
advantage  of  the  derived  parabolic  equation  is  that  a  boundary  value 
problem  is  replaced  by  a  much  simpler  initial  value  problem  The  method 
is  valid  assuming  that  the  fluctuations  of  the  material  parameters  are 
small,  and  slowly  varying  with  respect  to  the  wavelength. 
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In  relation  to  the  ray  method,  the  parabolic  method  does  not  require 
energy  to  remain  bounded  within  rigid  ray  tubes.  It  accounts  for  the  slow 
lateral  spread  of  ray  energy  along  the  wavefront  as  the  wave  propagates. 
The  parabolic  method  thus  includes  the  effects  of  diffraction  due  to  lateral 
diffusion  along  the  wavefront  (see  review  by  Malyuzhinets  [1959]).  This 
explains  the  similarity  in  form  between  the  parabolic  equation  and  the 
diffusion  equation.  The  parabolic  equation  is  also  closely  related  to  the 
Fresnel  approximation  in  optics  (see  Goodman  [1968],  Tappert  [1976]). 

Geophysical  applications  of  the  parabolic  method  include  long  range 
underwater  acoustics  where  the  global  direction  of  propagation  is  assumed 
to  be  horizontal  (Tappert  [1976],  McDaniel  [1976]),  and  reflection 
seismology  where  the  global  direction  is  assumed  to  be  vertical  for  upward 
travelling  reflected  waves  (Claerbout  [1976]).  Applications  to  elastic  waves 
are  discussed  by  Landers  and  Claerbout  [1972],  McCoy  [1977],  Hudson 
[1980a,  1980b],  and  Wales  and  McCoy  [  1983]. 

Derivations  of  one  way  parabolic  operators  by  the  use  of  splitting 
matrices  can  be  found  in  Corones  [1975]  and  McDaniel  [1976].  The 
importance  in  this  approach  is  the  potential  estimation  of  coupling 
between  forward  and  back  travelling  energy.  The  splitting  method  has 
been  extended  to  the  elastic  case  by  Corones  et.al.  [1982]. 

Another  method  for  including  non-Fermat  information  is  by  using  the 
Kirchhoff  integral  in  which  the  field  is  gotten  by  an  integration  along  some 


Intermediate  wave  surface  (see  Baker  and  Copson  [1950]).  For  elastic 
formulations  see  Pao  and  Varatharajulu  [1976]  and  Aki  and  Richards 
[i960].  For  geophysical  applications  see  Trorey  [1970,1977],  Hilterman 
[1970,1975],  Scott  and  Helmberger  [1983],  Hadden  and  Buchen  [1981],  and 
Frazer  and  Sinton  [  1984].  Evaluation  along  many  intermediate  surfaces 
leads  in  the  limit  to  a  path  integral  formulation  (Dashen  [1979],  Flatte 
et.al.  [1979]). 

In  this  thesis,  the  Gaussian  beam  method  is  investigated.  It  is  an 
extension  of  both  the  ray  method  and  the  parabolic  method  in  which  a 
global  direction  must  be  assumed.  The  Gaussian  beam  method  has  been 
described  in  the  literature  by  Popov  [1981,1982],  and  Cerveny  et.al.  [1982]. 
These  studies  were  based  on  the  scalar  wave  equation.  The  elastic  case  was 
first  derived  by  Kirpicnnichova  [1972]  and  more  recently  by  Cerveny  [1983] 
for  the  2-D  case  and  Cerveny  and  Pseacik  [1983]  for  the  3-D  case. 

The  Gaussian  beam  method  involves  three  basic  steps.  First,  a  system 
of  rays  must  be  traced  from  the  source  to  a  vicinity  of  the  receiver.  Since 
high  frequency  energy  flows  along  rays,  the  system  of  rays  can  be  thought 
as  the  framework  on  which  the  wavefield  is  to  be  built.  This  step  requires 
the  numerical  solution  of  the  kinematic  ray  tracing  system  Next,  the  wave 
equation  is  solved  in  the  ray  centered'  coordinates  for  each  ray  using  the 
•parabolic  approximation  This  approximation  is  a  local  one  related  to  a 
particular  ray  trajectory-  These  local  solutions  can  be  expressed  in  terms 


of  Gaussian  beams.  The  Anal  solution  is  the  result  of  superposing  local 
solutions  along  individual  rays  to  give  an  approximate  global  sc'ution  for  a 
given  source  condition.  This  step  relies  on  the  linearity  of  the  underlying 
wave  equation. 

There  are  several  advantages  in  using  the  Gaussian  beam  method.  In 
relation  to  the  ray  method,  the  GB  method  is  finite  at  caustics.  In  addition, 
no  prior  location  of  the  caustic  locations  is  required  as  in  the  Maslov 
method.  The  GB  method  relies  on  local  parabolic  approximations  and 
therefore  no  global  direction  of  propagation  is  assumed  as  in  the  standard 
parabolic  method.  Finally,  the  GB  method  is  fast,  comparable  to  the 
standard  ray  method,  and  possibly  faster  since  no  two  point  ray  tracing  is 
reguired. 

This  thesis  is  organized  as  follows.  In  section  2.2  the  Gaussian  beam 
method  is  applied  to  2-D  velocity  structures.  The  validity  of  the  method  is 
tested  in  two  ways.  First  is  the  application  of  the  reciprocal  theorem  for 
Green’s  functions  in  an  arbitrary  heterogeneous  medium.  The  discrepencv 
between  synthetic  seismog-ams  for  reciprocal  cases  is  considered  as  a 
measure  of  error.  Second,  the  Gaussian  beam  method  is  applied  to  cases 
for  which  solutions  are  known  by  other  approximate  methods.  This 
includes  the  soft  basin  problem.  The  method  is  then  used  for  two 
applications.  First,  it  is  used  to  study  volcanic  earthquakes  at  \tt.  St. 
Helens.  The  observed  large  differences  in  amplitude  and  arrival  time 
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between  a  station  inside  the  crater  and  those  at  the  flank  can  be  explained 
by  the  combined  effects  of  an  anomalous  velocity  structure  and  shallow 
focal  depths.  The  method  is  then  applied  to  scattering  of  teleseismic  P- 
waves  by  a  lithospere  with  randomly  fluctuating  velocities. 

Section  2.3  discusses  several  topics  in  Gaussian  beam  synthesis. 

This  includes  the  choice  of  the  beam  parameter,  c  ,  and  its  interpretation 
in  terms  of  a  complex  source  point.  Asymptotic  and  exact  evaluations  of 
the  Gaussian  beam  integral  for  a  line  source  are  then  investigated.  Next, 
the  problem  connected  with  an  initial  point  in  a  gradient  is  discussed, 
which  in  a  vertically  varying  media  reduces  to  the  difference  between  an 
initial  plane  wave  and  an  initial  Snell  wave.  A  comparison  is  done  with  the 
reflectivity  method  for  the  PP  phase  in  the  Imperial  Valley  structure. 
Then,  a  comparison  is  then  done  between  the  slant  stack  of  a  wavefield  and 
the  Gaussian  beam  decomposition  using  large  beam  widths,  showing  good 
agreement.  Beams  reflected  at  an  interface  are  then  investigated  using  a 
first  order  correction  to  the  plane  wave  reflection  coefficient  typically  used 
in  the  Gaussian  beam  method.  Finally,  the  finite  frequency  scattering 
effects  from  an  indented  interface  are  computed  using  the  Gaussian  beam 
method  and  compared  with  those  computed  using  the  Aki-Larner  method. 

In  the  final  section,  inversion  for  material  parameters  using  waveform 
data  is  investigated.  An  iterative  linearized  approach  is  used  in  which  a 
linear  sensitivity  operator  must  be  derived  (see  Tarantola  [1904;).  This 


can  be  done  economically  by  using  reciprocity  of  the  Green's  function.  In 
order  to  avoid  a  large  matrix  inversion,  several  descent  algorithms  are 
described.  Data  errors  and  a  priori  model  information  are  incorporated 
using  covariance  operators.  A  fast  and  reasonably  accurate  forward 
modelling  scheme  is  required,  and  here  we  make  use  of  the  Gaussian  beam 
method  in  a  slowly  varying  heterogeneous  medium.  Different  linearizations 
are  possible  including  a  linearization  in  terms  of  the  field,  che  Born 
approximation,  and  a  linearization  in  terms  of  the  log  field,  the  Rytov 
approximation.  The  relative  merits  are  discussed.  Finally  several 
numerical  examples  are  performed  using  a  field  linearization  in  order  to 
test  the  method.  Transmitted  body  waves  through  a  model  with  sources 
beneath  a  heterogeneity  and  surface  stations  are  used.  The  results  using 
the  waveform  data  identify  the  trial  structures.  A  comparison  is  done  with 
the  travel  time  inversion  results. 
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2.2  THE  2-D  GAUSSIAN  BEAM  SYNTHETIC  METHOD:  TESTING  AND 
APPLICATION 

2.2.1  INTRODUCTION 


In  this  paper,  we  will  discuss  applications  of  the  Gaussian  beam 
method.  The  Gaussian  beam  method  is  a  variation  of  both  the  asymptotic 
ray  method  [Cerveny  et.  al.,  1977]  and  the  parabolic  approximation  method 
[Tappert,  1976;  Claerbout,  L976].  The  Gaussian  beam  method  has  been 
recently  described  in  the  literature  by  Babich  and  Popov  [1901],  Popov 
[1901,  19021  and  Cerveny  et.  al.  [1902].  These  studies  were  based  on  the 
scalar  wave  equation  The  elastic  case  was  first  derived  by  Kirpicnichova 
[1971]  and  more  recently  by  Cerveny  and  Psencik  [1903a]  for  the  2-D  case 
and  Cerveny  and  Psencik  [  1.903b]  for  the  3-D  case. 


There  are  three  basic  steps  involved  in  the  Gaussian  beam  method. 
First,  a  system  of  rays  must  be  traced  from  the  source.  Since  high- 
frequency  energy  flows  along  rays,  the  system  of  rays  can  be  thought  of  as 
the  framework  upon  which  the  wavefield  is  to  be  built.  This  step  requires 
the  numerical  solution  of  the  kinematic  ray  tracing  system.  Second,  the 
wave  equation  (elastodynamic  equation)  is  solved  in  "ray  centered” 
curvilinear  coordinates  for  each  ray  using  the  parabolic  approximation. 
This  is  now  a  local  parabolic  approximation  related  to  a  particular  ray 
trajectory.  These  local  solutions  can  be  expressed  in  terms  of  Gaussian 
beams.  The  find  solution  is  the  result  of  superposing  local  solutions  along 


individual  rays  to  give  an  approximate  global  solution  for  a  given  sour.e 


condition.  This 


step  is  justified  by  the  linearity  of  the  underlying  wave 


equation. 


There  are  several  advantages  to  using  the  Gaussian  beam  method.  In 
relation  to  the  asymptotic  ray  method,  the  Gaussian  beam  method  is 
always  finite  at  caustics.  In  addition,  no  prior  knowledge  of  caustic 
locations  is  required  as  in  the  Maslov  method  [Kravtsov,  1988;  Chapman 
and  Drummond,  1 983 j .  Since  the  Gaussian  beam  method  relies  on  local 
parabolic  approximations,  no  global  direction  of  propagation  must  be 
assumed  as  in  the  standard  parabolic  approximation.  Finally,  the  Gaussian 
beam  method  is  comparable  in  cost  to  ray  methods,  and  possibly  even 
faster  since  no  two-point  ray  tracing  is  done. 

The  organization  of  this  paper  is  as  follow;  First,  a  more  detailed 
description  of  the  Gaussian  beam  method  is  given  along  with 
computational  procedures.  Then,  various  examples  are  given  in  order  to 
illustrate  different  uses  of  the  method  as  well  as  to  test  its  validity.  Two 
procedures  are  used  to  test  the  method.  First,  the  reciprocal  theorem 
valid  for  an  arbitrary  heterogeneous  medium,  is  applied.  The  discrepancy 
between  reciprocal  seismograms  is  considered  as  a  measure  of  the  error  in 
the  Gaussian  beam  solution.  Second,  the  Gaussian  beam  synthesis  is 
applied  to  several  cases,  including  the  2-D  soft  basin  problem,  for  which 
solutions  are  known  by  other  approximate  methods  The  Gaussian  beam 
method  is  then  used  for  two  applications.  Fust,  the  method  is  used  to 
study  volcanic  earthquakes  recorded  under  Mt.  51.  Helens.  Then,  the 
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method  ia  applied  to  scattering  of  teleseismic  P  waves  by  a  lithosphere  with 
randomly  fluctuating  velocities. 


2.2.2  REVIEW  OF  THE  GAUSSIAN  BEAM  METHOD 


In  this  section,  a  more  detailed  description  of  the  Gaussian  beam 
method  is  given.  The  discussion  is  limited  to  the  2-D  case,  for  which  the 
examples  are  given,  and  follows  the  results  of  Cerveny  et.  al.  [1982], 
Cerveny  and  Psencik  [1983a],  and  Cerveny  [IS83]. 

Before  solving  for  the  rays  and  the  beam  solutions,  the  velocity  model 
must  be  parameterized.  The  velocities  are  input  as  discrete  points,  and 
then  interpolated  using  cubic  splines.  This  results  in  continuous  velocities 
as  well  as  first  and  second  derivatives.  For  velocities  given  on  a  mesh  in  2- 
D  or  3-D,  this  involves  a  tensor  product  of  i-D  splines  [De  Boor,  1980]. 
Velocity  discontinuities  are  included  by  introducing  interfaces  between 
layers  with  smoothly  varying  velocities.  Curved  .nterfaces  are  input  as  a 
discrete  set  of  points  and  then  spline  interpolated. 

Spline  interpolation  of  the  velocity  may  introduce  some  oscillation  in 
the  interpolated  function  depending  on  the  input  grid  values  [Azbel  et.  al.. 
1980]  Care  must  be  taken  to  check  the  interpolated  function  and  to  have 
the  grid  velocities  change  slowly.  Several  methods  can  be  used  to  avoid  the 
oscillations  in  the  interpolating  function.  The  first  is  to  use  smoothed 
splines  as  described  by  Fretlova  [l9'7bj.  A  second  approach  is  to  use 
splines  under  tension  [see  Cline,  1981].  For  the  remainder  of  this  paper, 
careful  use  is  made  of  simple  bicubic  spline  interpolation  by  comparing  the 
desired  and  interpolated  functions. 
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The  first  step  of  the  Gaussian  beam  method  is  to  solve  the  kinematic 
ray  tracing  system  The  rays  are  extremal  of  rermats  integral  and  are 
determined  by  solving  the  ray  equations: 


dpi _ -3  dv 

-  SVPi, - -  - - 


ds 


i  =  ll2 

ds  dXi 


(1) 


with  initial  conditions; 


2(So)=*o,  PM=Po 

where  £  is  the  location  along  the  ray  and  p  is  the  slowness  vector  tangent 
to  the  ray.  In  an  elastic  medium,  the  velocity,  v  ,  is  either  a  or  jS  .  This 
system  can  be  solved  by  standard  numerical  techniques  such  as  the 
Runge-Kutta  method.  At  an  interface,  Snell’s  law  is  applied  locally.  These 
ray  equations  are  appropriate  for  an  isotropic  nor.  iispersive  media.  In  an 
isotropic  dispersive  media,  wave  packets  will  travel  at  the  group  velocity, 
but  their  trajectories  will  be  determined  by  the  local  phase  velocity. 


The  second  step  of  the  Gaussian  beam  method  involves  solving  the 
wave  equation  locally  in  ray-centered  coordinates  using  the  paraboiic 
approximation.  The  ray-centered  coordinate  system  is  an  orthogonal 
curvilinear  system  that  follows  along  a  particular  ray  and  was  introduced 
to  seismology  by  Popov  and  Psencik  [1976,  1978].  tn  a  2-D  medium  the 
ray-centered  coord. nates  can  be  specified  by  the  unit  vector  t  tangent  to 
the  ray  and  the  unit  vector  n  normal  to  the  ray  (see  Figure  l).  An  element 
of  infinitesimal  length  .u  the  ray-centered  coordinates  can  be  written 
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Figure  1  Ray  centered  coordinate  system  in  2-D.  The  beam  solution 
concentrated  near  the  ray  and  has  a  Gaussian  shape. 


\df\z-  /i?is2+/ifdn2  (2) 

where  the  scale  factors,  h.^  and  h.%,  are  given  by 

=  L+v_1vnn  7i2  =  l 

where  v  and  v  n  are  evaluated  on  the  central  ray  with  n=0  .  The  ray- 
centered  coordinates  comprise  an  orthogonal  system.  Since  the  local  flow 
of  high-frequency  energy  is  along  rays,  the  ray-centered  coordinate  system 
is  an  appropriate  system  for  approximating  the  wave  equation  by  local 
one-way  operators. 

The  elastodynamic  equation  is  then  written-  in  orthogonal  curvilinear 
coordinates,  as  in  Aki  and  Richards  [i960,  eqn.  2.4S],  using  scale  factors 
for  the  ray-centered  system.  The  basic  step  in  delving  a  one-way  operator 
from  the  elastodynamic  equation  is  the  substitution 

U;(s  .n.u.t')  =  (3) 

where,  r(s)  =  y*v(s)“lds  is  the  phase  delay  along  the  central  ray,  v(s)=a  or 
v(s)=0,  and  V:  is  a  slowly  varying  envelope.  The  envelope  is  then  expanded 

as  Uj  -  £  0]u^/z .  Note  that  this  is  in  half  powers  of  «“l  as  opposed  to 

i«0 

the  typical  ray  series  expansion  in  integral  powers.  In  2-D,  U3  and  Un  are 
the  in-plane  components  along  the  ray  and  normal  to  it,  and  U 5  is  the  out 
of  plane  component  normal  to  the  ray.  After  substitution  of  equation  (3) 
into  the  elastodynamic  equation,  the  resulting  equations  are  approximated 
for  large  v.  The  following  results  are  obtained  by  retaining  only  terms  of 


order  u*  y*  1.  and  letting  i/=wl/2n  assuming  that  v=0(l).  To  this  order 
of  approximation,  the  P  and  S  components  decouple.  For  an  incident  P 
wave  [Cerveny  and  Psencik,  1983]: 

w*  -  vsdmw 

£/°  =  0  (4) 

£/b=0 

where  W*  satisfies  the  foliowing  parabolic  equation 

2io"1  W%  +  W%  -  o^a.nnU2^  =  0  (5) 


For  an  incident  SV  wave 


£/,°  =  0 

£4l  =l/3£/n°.u 


£4=0 

where  satisfies  the  parabolic  equation 


2 ir'wS  +  -/9“3/9.nnv2^  =  0 


(6) 


(7) 


and  for  an  incident  SH  wave 


L’f(s.v) 


l 

^0(s)p(s) 


fK*(s  ,y) 


(8) 


£4  =£4=0 

with  »»*  satisfying  the  same  parabolic  equation  *s  fur  tiie  SV  case.  Ail 
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velocities  and  densities  are  evaluated  along  the  central  ray.  These  results 
were  derived  from  asymptotic  analysis  of  the  elastodynamic  equation.  The 
same  basic  parabolic  equation  for  W  can  also  be  obtained  from  the  eikonal 
equation  as  shown  in  appendix  1. 


The  last  term  in  the  parabolic  equation  (5)  or  (7)  comes  from  the 
approximation  of  order  cj 


h-l-A  ,  1  '  "z  -  .A'-zl?*  -^(srVnn"* 

v{s,n)z  h.fv(sr 


W 


where  u(s)  and  uinn  are  evaluated  along  the  central  ray,  v(s,n)  is  the 
velocity  in  the  vicinity  of  the  central  ray,  and  h.l  is  the  scale  factor  for  the 
ray-centered  coordinates.  Without  this  approximation  for  the  last  terra 
the  parabolic  equation  (5)  or  (7)  is  similar  in  form  to' those  given  by  Aki 
and  Richards  [i960,  eqn.  13.154]  or  Claerbout  [1976,  10-3-9].  The 
parabolic  equation  (o)  or  (7)  is  valid  for  small  angle  forward  scattering 
about  the  central  ray  direction  assuming  large  scale  inhomogeneities  with 
respect  to  the  wavelength  In  addition,  the  validity  of  the  above 
approximation  requires  that  the  velocities  be  smoothly  varying  within 
certain  beam-width  of  the  central  ray  Since,  as  will  be  seen,  the  beam 
solutions  to  (5)  or  (7)  spread  with  distance,  this  will  result  in  a  range 
limitation  for  the  Gaussian  beam  method. 


A  particular  sciufjjbr.  g: 
written  as 


parabolic  equation  (5)  or  (7)  can  be 


junurur rir*  v>  v> w ww  w v*  iv wLvi%mv. mv rw  w lvwv  wuvv w vwi »wV* avax. t 


(9) 


with  u(s)  and  vifUl  evaluated  along  the  central  ray.  These  are  the  same 
equations  as  the  dynamic  ray  tracing  system  for  ray  theory,  except  here 
>i(s)  and  q(s)  have  to  be  complex.  j>(s)  and  g(s)  must  also  satisfy  certain 
conditions,  among  them  the  condition  Im{jp/q)> 0.  In  this  case  the 
energy  will  be  confined  to  a  vicinity  of  the  central  ray,  and  the  solution  will 
be  a  Gaussian  beam  (see  Figure  l).  For  ray  theory  p  and  q  are  real,  and 

q(s)  has  the  interpretation  of  spreading  and  has  the 

interpretation  of  wavefront  curvature.  The  equations  in  (10)  are  in 
transmission-line  form,  where  the  ratio  p/q  is  related  to  the 
characteristic  admittance  for  the  system. 


The  complete  solution  to  the  parabolic  equation  for  a  particular 
central  ray  is  made  up  of  an  infinite  system  of  linearly  independent  beam 
solutions  or  modes  [Cerveny  et.al.,  1982]  of  the  form 

_ 

HUs.tO  =  — =srr(  2— )k/zHk(u^lm(p/ q)}e  2  q  k-  0,1,2  U0 

v<7  (s )  <7 

where  are  hermita  polynomials.  Here,  only  the  zerclh  order  beam 
solution,  (9).  is  used  for  each  central  ray  and  no  mode  coupling  between 
higher  order  beam  solutions  is  assumed. 


•  The  localized  solution  to  the  2-D  elastodynamic  equation  along  a 
particular  central  ray,  specified  by  a  ray  index  parameter,  6,  can  be 
written  in  the  following  form[Cerveny  and  Psencik,  1983a] 


l. 


■7*e 


(12) 


where  for  P  waves,  u(s)=a(s),  f  =  [f  +  rla(s)2|£^-n,] ; 


for  SV  waves,  t»  (s )  =  0(s ) 


and  for  SH  waves,  v(s)  =  /J(s)  f  =  S  (out  of  plane  component)-, 

where  p  and  q  solve  (10)  and  are  in  general  complex,  and  f  gives  the 
orientation  of  the  displacement  vector  nearby  the  central  ray.  Note  that 
the  beam  solutions  are  valid  only  where  the  ray-centered  coordinates  are 
well  defined  and  regular  in  some  region  nearby  the  central  ray. 


The  beam  solutions  must  be  modified  m  the  presence  of  any  interfaces. 
Following  Popov  [1982],  it  is  assumed  that  to  first  approximation  the  beam 
solutions,  (11),  for  a  given  ray  remain  uncoupled  at  a  smoothly  curved 
interface.  Then,  the  particular  beam  solution,  (9),  is  only  modified  by  a 
possibly  complex  refi.ection  or  transmission  coefficient  and  the  change  m 


p(s 


(s',  and  q  s)  d  .e  •. o  me  interface.  Formulas  for  ho -y  real-valued  p(s) 


and  q(s)  transform  at  an  interface  are  given  by  Cerveny  and  Psencik 
[1979],  and  for  curvature  by  Cerveny  and  Hron  [l98U_.  Formulas  for  how 


complex-valued  p(s)  and  g(s)  transform  at  an  interface  are  given  by 
Cerveny  [  1983c],  where  the  equations  are  written  so  that  g(s)  is  a  relative 
spreading  across  each  interface.  This  avoids  writing  an  additional  factor  in 
the  beam  amplitude. 

For  Gaussian  beams,  p(s )  and  g(s)  can  be  written  as  [see  Cerveny  et. 
al.,  1982] 

g(s)  =cqi(s)  +q2(s)  (13) 

p(s)  =epi(s)  +pz(s) 

where  p(s)  and  q{s)  satisfy  equation  (10),  £  is  a  complex  valued 
parameter  to  be  determined,  and  ^  is  the  fundamental  matrix  of 

linearly  independent  real  solutions  of  (10)  with  initial  conditions 
0  iAo  From  these  initial  conditions 

A', (s0)  =  =  0  •*  inital  plane  wave  at  source 

Kz(sq)  =  =  «  -*  inital  Line  source 

9  2($o) 

where  vo=k(so)  ■  Thus,  the  complex  solution  is  a  linear  combination  of 
real  solutions  for  a  plane  wave  and  line  source  weighted  by  c  as  in  (13).  e 
is  a  factor  rbcsen  so  that  a)  7(s)*0;  resulting  in  no  singularities  in 

amplitude  along  the  ray.  and  b)  Im(*4~)>0;  implying  'hat  the  solution 

9  / 

is  concentrated  near  the  ray.  Following  Cerveny  et.  al.  [1982].  e  is  written 
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In  the  form 

'  =  U4) 

where,  geometrically,  La  is  the  beamrwidth  at  a  specific  location  along  the 
beam  known  as  the  beam  waist,  and  50  is  a  parameter  which  shifts  the 
location  of  the  beam  waist  away  from  the  source  location,  sQ  ,  along  the 
ray.  Cerveny  et.  al.[l982]  showed  that  conditions  a)  and  6)  above  are  both 
satisfied  when  La*0.  This  result  depends  on  the  fact  that  the  determinant 
of  the  fundamental  matrix  above  is  a  nonzero  constant  along  the  ray,  and 
therefore,  g\(s)  and  ?2(s)  cannot  simultaneously  be  zero  for  a  given  s. 
Thus,  the  method  is  constructed  to  ensure  no  singularities  in  amplitude. 


The  fundamental  beam  solution  can  now  be  written  from  (12)  as 


where 


(15) 


g(s)=[50-ir=— ^]<7i+<72  =  complex  spreading 
ZVq 

K(s)  -  v(s)Re[^s^''  -  phase  front  curvature  of  the  beam 
q{s)J 


K(s)  = 


v(s)[(5Qpl+p2)(50gl+<72/  + 

r  /  ,2  I  /  ^L-h  \ 

1  \  w  '.Q  »  t  (  .  t  w  » 

•  • 


I 


(18) 


L(s)  =  i  i~'/2  =  hal/width 

2 

L(s)  =  [L$q  f  +(^')2(^o‘7i+72)2]l/2 

For  convenience,  the  parameter  L0  will  sometimes  be  used  to  specify  the 

Zvq 

initial  beam-width  at  the  beam  waist.  L0  is  specified  by  Lu  =  — —  L0  , 

CJ 

and  is  independent  of  frequency  since  from(l8),  Lu  =  /.(sq-Sq)  =  Z)- 

The  properties  of  a  Gaussian  beam  in  a  homogeneous  medium  are 
briefly  reviewed  in  Appendix  2.  An  example  of  a  Gaussian  beam  in  a 

homogeneous  medium  with  Ly  =  l  and  X  =  is  shown  in  Figure  2.  The 

beam  waist  is  located  at  s  =  0  .  The  solid  lines  show  the  spread  of  the 
beam  with  distance  and  the  dashed  straight  lines  are  the  far  field 
asymptotes.  The  distance  sc  separates  the  columnated  near  zone  from 
the  diverging  far  zone  of  the  beam  and  is  proportional  to  the  initial  beam 
width  squared  over  the  wavelength.  The  near  zone  has  planar  phase  fronts 
while  the  far  zone  diverges  as  if  from  a  point.  By  adjusting  the  initial 
beam-width,  the  beam-width  observed  at  a  given  station  range  can  be 
changed.  Adjusting  the  initial  beam-width  so  that  the  station  distance  is 
just  sc  results  in  the  smallest  possible  beam-width  at  the  station.  This  will 
be  ca'Jed  r.he  ''r"'  ■  h<»arr.-wid*b.  ?cr  a  given  s'.Vior.  range  It  also 

places  the  static:  (us|  os.^een.  tr.e  columnated  plur.ur  part  of  the  beam 
and  the  far  field  spread.r.g  part.  For  a  homogeneous  medium  the  critical 


Figure 
X  : 


2  A  Gaussian  beam  in  a  homogeneous  medium  with  Lu 
rr/4. 
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initial  beam-width  is  Lu„  =  'AsT^-  For  the  general  inhomogeneous  case 

L„  (19) 

M#r  y  9 1 

[see  Cerveny  et.al.,  1982] 

An  alternative  way  to  generate  a  Gaussian  beam  in  a  homogeneous 
medium  is  to  use  an  initial  planar  Gaussian  amplitude  distribution  and 
diffract  it  using  the  Fresnel  diffraction  integral  [see  Marcuse,  1902].  Since 
the  Gaussian  beam  is  a  known  solution  to  the  Fresnel  integral  fcr  a  given 
initial  amplitude  distribution,  the  field  from  an  arbitrary  initial  amplitude 
distribution  can  be  approximated  as  a  sum  of  basis  Gaussian  beams. 


The  final  step  of  the  Gaussian  beam  method  is  a  superposition  of  beam 
solutions  to  represent  a  given  source  using  the  proper  weight  functions, 
$(<5),  where  <5  is  the  ray  index  parameter.  For  an  initial  line  source,  <5 
represents  an  initial  takeoff  angle  for  a  given  ray.  For  an  initial  plane 
wave,  <5  represents  the  distance  along  the  wavefront  for  a  given  ray.  Each 
beam  solution  solves  the  wave  equation  in  the  vicinity  of  a  ray,  and 
together,  the  weighted  superpostion  approximately  satisfies  a  source 
condition  for  a  given  source.  Thus,  the  following  integral  over  ray  index 
parameter  and  frequency  can  be  written 
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where  F{~)  is  the  .r-'c  spectra,  and  -r0  ,  6U  define  :h?  range  of  the  ra;. 
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index  parameter.  <50  and  iy  must  be  chosen  to  adequately  cover  the 
area  of  interest  with  rays.  Using  the  slowness  approach  [Chapman.  1970] 

u(x,t)=  'I‘*(6)g{x,t,6)d6  (21) 

where  g{£,t)  is  a  wave  packet  [see  Cerveny,  1983].  In  the  examples.  F(cj) 
is  the  source  spectra  for  the  following  damped  cosine  wavelet  [see  Cerveny 
et.al.,  1977] 

/(£)  =  exp(-(o£/7)2)cos(c;£  +?)  (22) 

Using  this  source  wavelet,  the  wave  packet  §{2,t,6)  is  approximately 
Gaussian  in  both  time  and  space  [Cerveny,  1903].  Equation  (21)  is  then 
approximated  by  the  finite  sum 

H(i.t)  =  £$(*i  )<?(*.*  A  )A3  (23) 

i  *0 

where  must  be  chosen  small  enough  to  smoothly  interpolate  the  beam 
solutions  and  to  adequately  sample  the  medium. 

The  weight  factors  $(<5)  for  a  given  source  must  then  be  specified. 
The  weight  function  fcr  an  acoustic  line  source  is  given  by  Popov  [  1982] 
and  Cerveny  et.  al.  [1982],  and  for  a  plane  wave  by  Cerveny  [1982].  In 
Appendix  3,  the  weight  function  for  a  2-D  point  force  in  an  elastic  medium 
is  obtained  by  comparing  ‘.he  Gaussian  beam  superposition  with  the  exact 
plane  wave  dec  c rr.pru  '.Jfef 


There  are  thus  several  free  parameters  when  computing  the  Gaussian 
beam  solution  for  a  given  problem:  <50,  and  A<5  for  the  finite  sum.  the 
initial  beam-width  at  the  beam  waist  Lu,  and  the  shift  of  the  beam  waist 
away  from  the  source  given  by  the  parameter  Sq.  The  sensitivity  of  the 
method  on  these  parameters,  particularly  Lu,  will  be  investigated  in  several 
of  the  following  examples. 


2.2.3  APPIJCATIONS 


In  this  section,  several  applications  of  the  Gaussian  beam  method  are 
given  in  order  to  illustrate  the  uses  of  the  method  and  test  its  range  of 
applicablitiy.  The  computer  program  for  the  Gaussian  beam  synthesis  was 
modified  from  the  ray  tracing  program.  RAY81.  written  by  l.  Psencik  [  1983]. 
For  all  examples,  the  slowness  method  is  used  in  which  the  frequency 
integral  is  evaluated  first  assuming  Gaussian  packets  and  a  damped  cosine 
source  wavelet  [Cerveny,  1983]. 


Ex  ample  A 


In  this  example,  an  initial  plane  wave  in  a  homogeneous  medium  is 


decomposed  into  a  number  of  centered  Gaussian  beams.  Each  beam  is 
propagated  along  the  straight  Line  rays  shown  in  Figure  3a.  The  initial 
plane  wa'fe  is  located  at  z=-5G  km.  The  station  is  shown  by  the  triangle  at 
x=25  km.  z=0  km.  The  velocity  is  6  km/sec  and  the  source  wavelet 
parameters  are,  /  =2  hz,  7=4.  <p0=Q  (see  eqn.  (22)  ).  The  contribution  of 
each  beam  to  the  resulting  sum  is  shown  for  3  different  initial  beam-widths 
in  Figures  3b-d.  The  initial  beam-width  is  represented  by  L0  .  a  frequency 

2v  1/2 

independent  parameter  where  £ <*=(-“-)  has  units  of  km  and  L0 

CJ 


has  units  of  km1'  2  For  this  example.  L^-SSLo 
rep»esenls  the  critic -.1  utilial  beam-width  '-hat  g 
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Figure  3  Superposition  of  Gaussian  beams  to  represent  an  initial  plane 
wave  in  a  homogeneous  medium,  a)  ray  diagram  for  initial  plane  wave 
at  z=-50.  km.  Individual  beam  contributions  indexed  by  ray  number 
along  the  wavefront  and  the  resulting  sum  for  three  initial  beam-widths 
b)  Z,0=  LOO  kmw2.  c)  L0=7  km'/2,  and  d)  Z>o=l  kmuz.  e)  The  amplitude 
of  the  Gaussian  beam  sum  as  a  function  of  Lq. 


width  at  the  station  range  of  50  km.  Larger  than  critical  initial  beam 
widths  result  in  planar  beams  at  the  receiver,  while  smaller  than  critical 
initial  beam-widths  result  in  diverging  phase  fronts  at  the  receiver  as  if 
from  a  point. 

The  'ray  number’  in  Figures  3b-d  represents  the  ray  index  parameter, 
which  for  this  example  measures  distance  along  the  initial  plane  wavefront. 
The  initial  ray  spacing  is  2  km  The  rays  in  the  ray  diagram  (Figure  3a)  are 
assigned  a  number  from  l  to  25.  The  beam  going  directly  to  the  station 
follows  along  ray  number  13.  The  scaled  sum  for  each  initial  beam-vridth  is 
shown  to  the  right  in  Figures  3b-d. 


For  £0=l  >vmw2 .  less  than  critical,  oniy  a  limited  number  of  beams 
around  the  direct  ray  contribute  to  the  resulting  sum  due  to  phase 
interferenc  e  (see  Figure  3d).  There  is  also  a  spurious  end  effect  in  the 
resulting  sum  caused  by  lack  of  cancellation  of  the  end  beams.  As  the 
initial  beam- width  is  changed  to  L0=7  kmuz  ,  near  critical,  again  only  a 
limited  number  of  beams  contribute  to  the  resulting  sum  (see  Figure  3c). 
This  is  now  caused  by  amplitude  decay  of  beams  away  from  the  station. 
The  spurious  end  effects  have  been  avoided  in  the  resulting  sum  by  using 
the  critical  initial  beam-width.  Finally  for  £,0=1-00  &m1/2  ,  larger  than 
critical  the  bean':  :re  mere  planar  at  the  station  dutar.ee  and  no 
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Figure  3e  shows  the  amplitude  of  the  resulting  sum  as  a  function  of 
initial  beam-width.  The  critical  initial  beam-width  is  shown.  The  true 
amplitude  at  the  receiver  is  one.  For  larger  than  critical  initial  beam- 
widths,  the  amplitude  uf  the  Gaussian  beam  sum  falls  oft  since  a  finite 
aperture  of  rays  is  used. 

The  following  comments,  can  be  made  concerning  decomposing  an 
initial  plane  wave  in  a  homogeneous  medium  into  Gaussian  beams.  For 
larger  than  critical  initial  beam-widths,  the  amplitude  is  not  stable  using  a 
finite  ray  aperture.  For  near  critical  initial  beam-widths,  the  result  is 
stable  by  amplitude  decay  of  beams  away  frorv'the  receiver.  The  end 
effects  are  avoided  here  by  amplitude  decay.  smaller  than  critical 
initial  beam-widths,  a  stable  result  occurs  by  phase  interference,  but 
spurious  end  effects  are  noticeable  in  the  resulting  sum.  This  is  similar  to 
a  decomposition  of  the  initial  wavefrcnt  into  point-like  sources  as  viewed 
at  the  receiver 


Example  B 

In  this  ss;.arr.?,e  an  initial  half  plane  is  decomposed  trto  Gaussian 
beams.  The  ha  if  plane  s  lo  elite  a  at  a  d<jp>.h  of  2  5  km  and  terrmnaleo  a1 
x=!5  krr.  as  seen  ;n  r,j urt  ’a  Receivers  are  located  on  the  surface  from 
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12  t'o  18  km  with  a  spacing  of  0.2  km.  The  velocity  is  2.5  km/sec  and  the 
source  wavelet  parameters  are,  /  =40  hz ,  7=4.  ^q=7t/  2  . 

The  initial  beam-width,  i».  is  related  to  L0  by  Lu  =  0.2£,0  .  The 
critical  value  of  L0  for  a  receiver  range  of  2.5  km  is  Loer='/s  =1.58  kmuz  . 
Figure  4b  shows  the  resulting  seismograms  for  A0=l.5  kmx/ *  .  Since  the 
initial  amplitude  on  the  edge  of  the  half  plane  goes  to  zero  gradually  ever 
0.3  km  the  amplitude  at  the  receivers  smoothly  goes  to  zero  at  x=l5  km 
with  no  evident  edge  diffraction.  When  Lq  is  reduced  to  05fcm1/2,  the 
initial  amplitude  on  the  edge  of  the  half  plane  goes  to  zero  much  more 
sharply.  A  diffracted  wave  from  the  edge  of  the  half  plane  can  now  be  seen 
(Figure  4c). 


Example  C 

In  this  e.  mple,  the  decomposition  of  a  line  source  in  a  homogeneous 
medium  into  Gaussian  beams  is  investigated.  Caen  beam  is  propagated 
along  the  rays  shown  in  the  ray  diagram  in  Figure  5a.  for  a  source  at  x=25 
km  z=-50  km  The  station  at  which  the  resulting  sum  is  evaluated  in  shown 
by  the  triangle  located  at  x=25  km  z=0  km.  The  velocity  is  6  km/scc  and 
the  source  wavelet  parameters  are,  /  =2  h.z,  y=4.  The  contribution 

of  each  beam  :o  t~.«  resulting  sum  at  the  station  •*  shown  for  three  initial 
beam-widths  tn  Figures  5b-d.  The  initial  beam-width  is  represented  by  L0 
where.  L;/=98£n  "The  -riy  number'  corresponds  to  the  ray  index 
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parameter  which  for  this  case  is  the  initial  takeoff  angle  at  the  source.  The 
rays  in  the  ray  diagram  (Figure  4a)  are  assigned  a  number  from  l  to  18. 
The  beam  going  directly  to  the  station  is  represented  by  ray  number  10. 
The  scaled  .sum  for  each  initial  beam-width  is  shown  to  the  right  in  Figures 
5b-d. 


The  initial  beam-width  of  Z,0=lO  kmx/z  is  slightly  larger  than  the 
critical  value  of  £0=7  .  For  a  given  station  distance,  LQ  equal  to 

critical  separates  planar  beams  from  the  beams  that  diverge  as  if  from  a 
point.  For  £0=lOO  kmx/z  ,  larger  than  critical,  only  a  limited  number  of 
beams  contribute  around  the  direct  ray  due  to  phase  interference  (see 
Figure  5b).  A  noticeable  end  effect  from  the  finite  ray  aperture  can  be 
seen  in  the  sum.  For  Lq-10  kmx/z  ,  again  only  a  limited  number  of  rays 
about  the  direct  ray  contribute  to  the  sum  (see  Figure  5c).  This  is  now  due 
to  amplitude  decay  of  beams  away  from  the  station.  The  end  effects  have 
been  avoided  in  this  case.  For  £,< j=l  km'/z  ,  smaller  than  critical,  neither 
amplitude  or  phase  interference  limits  the  number  of  beams  contributing 
about  the  direct  ray  (see  Figure  5d).  The  result  then  will  depend  on  the 
ray  aperture  used. 


The  amplitude  of  the  Gaussian  beam  seismogram  as  a  function  of 
initial  beam-width  ic  shewn  tn  Figure  5e.  The  amplitude  ci  1.0  in  Figure  5s 
corresponds  to  the  amplitude  of  the  far  held  approximation  for  a  line 
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Figure  5  Superposition  of  Gaussian  beams  to  represent  a  line  source  in  a 
homogeneous  medium  a!  ray  diagram  for  line  source  at  x=25.,  z=-50 
Individual  beam  contributions  indexed  by  ray  number  representing 
initial  takeoff  angle  anti  the  resulting  sum  for  b)  L0=]  00,  kmuz,  o' 

L0=10.  km.  ,  and  d)  A0=l.  kmuz  e)  The  amplitude  of  the  Gaussian 
beam  sum  as  a  functon  of  L0 
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|  U(D)\  =  .01949  .  The  amplitude  of  the  Gaussian  beam  sum  for 
Z,0=lO  kmwz  is  !  ilGB |  =  .01958.  For  larger  than  critical  initial  beam- 
widths,' the  amplitude  of  the  Gaussian  beam  synthesis  is  stable  within  1/5  of 
the  far  field  line  source  solution.  For  smaller  than  critical  initial  beam- 
widths,  the  amplitude  of  the  Gaussian  beam  sum  falls  off. 

The  following  conclusions  can  be  made  concerning  decomposing  a  line 
source  into  Gaussian  beams.  For  larger  than  critical  L0  ,  a  stable  result 
occurs  by  phase  interference.  This  is  similar  to  a  decomposition  of  a  line 
source  into  plane  waves.  Spurious  end  effects  occur  for  this  case  due  to 
the  finite  ray  aperture.  Using  the  critical  Lq  ,  a  stable  result  occurs  by 
amplitude  decay  and  end  effects  are  avoided.  Finally  for  smaller  than 
critical  Lq  ,  the  amplitude  is  not  stable. 


Example  D 


In  this  example,  an  Sri  line  source  above  a  -velded  interface  .s 
investigated.  The  velocities  are  3. 7  km/sec  over  4.6  km/sec  with  the 
source  and  receivers  located  30  km  above  the  interface.  The  receivers 
range  from  20  km  to  200  km  with  a  spacing  of  20  km  The  model  is  shown  in 
Figure  6a. 


Figure  5b  shews  :  r.e  seismograms  computed  using  the  Cagmuri 
method  [.see  Ala  and  P. shards.  1900].  The  Cugniard  result  is  convolved  with 


a  .source 


y^yglgt.  with  parameters  J  —3  h, z,  y  “4,  •  ~iid  _  the 

seismograms  are  plotted  in  reduced  time,  T— A/ 4.6  .  The  precritical 
reflection  coefficient  goes  to  zero  near  60  km  and  then  changes  sign.  The 
critical  distance  is  located  at  81.2  km,  and  the  amplitude  becomes  large  at 
a  distance  slightly  greater  than  the  critical  distance.  For  post-critical 
distances,  total  reflection  occurs  with  a  phase  shift.  A  small  head  wave  can 
also  be  seen  emerging  past  the  critical  distance.  The  Gaussian  beam  result 
using  a  larger  than  critical  initial  beam-width,  £c=75  km^z  ,  is  shown  in 
Figure  6c.  The  Gaussian  beam  result  is  similar  to  the  Cagniard  result, 
including  the  head  wave  phase.  But,  there  also  exists  a  spurious  end  effect 
cutting  across  the  Gaussian  beam  seismograms  from  about  6  seconds  at 
100  km  to  about  12  seconds  at  200  km. 

The  Gaussian  beam  result  for  three  initial  beam-widths  is  shown  in 
Figures  7a-c.'  The  value  of  Lq  equal  to  20  is  slightly  larger  than  critical 
for  the  farthest  station.  For  £0=  20  kmwz,  the  small  head  wave  phase  is 
not  generated,  but  the  other  phases  compare  with  the  Cagniard  result.  For 
Larger  Lq  ,  the  head  wave  begins  to  form  In  addition,  an  unwanted  end 
effect  begins  to  appear. 


Example  E 

In  this  example,  ’.he  effect,  of  a  single  caustic  in  a  layer  over  a  gradient 
is  reviewed.  This  is  a  standard  example,  but  illustrates  several  features  of 
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the  Gaussian  beam  solution  [see  Cerveny  et.  al..  1977;  Cerveny  et.  al..  1982; 
Klimes  and  Cerveny,  1983].  The  velocity  model  is  constant  to  15  km  with  a 
velocity  of  5.6  km/sec  and  increases  linearly  from  15  km  down  to  40  km 
with  a  velocity  of  8  km/ sec.  The  ray  diagram  is  shown  in  Figure  8a  using 
36  rays  and  a  ray  spacing  of  l  degree.  The  wavelet  parameters  are 
/  =4  hz,  7=3,  p0=0.0  . 

A  caustic  intersects  the  surface  at  L20  km.  Ray  theory  would  predict 
an  infinite  amplitude  at  the  caustic  and  no  energy  in  the  caustic  shadow 
The  Gaussian  beam  seismograms  are  shewn  in  Figure  8b-c  for  two  initial 
beam-widths,  LQ=Qkml/z  near  the  average  critical  value  for  the  set  of 
rays,  and  Z,©  =  128  fcml/2  So  is  chosen  to  be  0.0.  locating  the  beam-waist 
at  the  source.  The  Gaussian  beam  seismograms  -how  several  interesting 
features  including,  the  shift  of  the  peak  amplit  de  to  the  right  of  the 
caustic  location  and  the  penetration  of  energy  into  the  caustic  shadow.  As 
a  result  of  phase  interference  between  beams,  the  t,0=L29  kml/z  docs  not 
penetrate  any  further  into  the  caustic  shadow  than  the  Lq=B  kml/z 
solution.  But,  spurious  phases  can  be  seen  in  the  £.o  =  128  kml/z  solution 
which  don’t  quite  cancel  with  the  ray  set  used. 
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The  results  for  a  station  at  14C  km  are 


are  9a  shows  the  ray  diagram  using  18  rays  and  a 
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ray  -spacing  of  2  degrees  The  station  is  located  by  the  triangle,  and  two 
geometric  arrivals  are  seen  to  arrive  at  this  station.  Figures  9b-c  shows 
the  individual  beam  contributions  for  L0=Bkml/z  and  £0=128  kmwz . 
The  'ray  number’  represents  the  ray  index. parameter.  The  rays  are  here 
simply  indexed  from  l  to  18  from  a  shallow  initial  takeoff  angle  to  a  steep 
Initial  takeoff  angle  from  horizontal.  The  two  geometric  “rivals  are  seen 
in  the  scaled  sum  to  the  right  in  Figures  9b-c. 

For  £<}= 8  lcml/z  ,  near  the  average  critical  value,  the  energy  is 
concentrated  around  the  two  direct  beams  to  the  station  with  ray  numbers 
4  and  17.  The  individual  beam  wavelets  in  Figure  9b  map  out  a  curve  which 
has  stationary  points  corresponding  to  the  direct  beams  to  the  station. 
For  Z,0=t28  krn]/z ,  the  energy  is  more  evenly  distributed  among  the 
beams  as  seen  in  Figure  9c  The  beam  solutions  away  from  the  stationary 
points  now  phase  interfere  giving  a  similar  resulting,  sum  as  for  the 
Lq=8  kmx/z  case. 

For  very  large  Lq  .  each  beam  solution  is  effectively  an  initial  plane 
wave  contribution.  For  a  large  Lq  and  a  one  dimensional  medium,  the 
individual  beam  contributions  in  phase  time  and  ray  number  are  similar  to 
plane  wave  decompositions  m  phase  time,  0  =  r  +  px  ,  and  horizontal  ray 
parameter  [see  Charmer.  ID'S] 

Figure  10  snows  the  results  for  a  station  at  x=l20  km  where  the 
caustic  intersects  the  surface  Here  the  two  geometric  arrivals  merge  as 
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one.'  Figure  10b  shows  the  individual  beam  contributions  for  L0  =  Qkml/Z 
Again  the  'ray  number’  represents  the  ray  index  parameter  and  indexes 
the  rays  from  l  to  18.  There  is  now  only  one  stationary  point  in  the  curve 
represented  by  the  locus  of  individual  beam  wavelets,  located  at  ray 
number  10.  The  result  for  £o=l20  fen 1/2  is  shown  in  Figure  10c.  The  sum 
shows  a  spurious  end  effect  caused  by  the  lack  of  cancellation  of  the  end 
beams.  The  L0=&  kmx/i  case  avoids  this  end  effect  by  amplitude  decay  of 
beams  away  from  the  station. 

In  Figure  11.  the  beam  contributions  are  shown  for  a  station  at  x=110 
km  in  the  caustic  shadow.  Here  no  geometric  arrival  occurs  and  the 
individual  beam  wavelets  shown  in  Figures  llb-c  have  no  stationary  points. 
The  £0=8fcml/2  case  shows  a  clean  resulting  sum,  whereas  the  the 
£.0= 128  1/2  case  show*  spurious  end  effects  which  are  the  same  order  as 

the  actual  signal. 

Example  F 

In  this  example,  the  effect  of  caustics  on  q(s)  is  investigated.  The 
caustics  result  from  multiple  bounces  of  rays  in  a  surface  waveguide.  The 
velocity  model  consists  of  a  0  l  km  homogeneous  surface  layer  having  a 
velocity  of  1  8  km/ sec  over  a  linear  velocity  gradient  from  1.8  km-'sec  to 
5.6  krn/sec  at  5  0  km.  The  ray  diagram  for  a  source  m  the  thin  surface 
layer  is  shown  m  Figure  i 2a.  The  complex  function  qr[s),  is  shown  m 
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Figure  12b  as  a  function  of  distance  along  the  ray.  qr(s)  is  a  relative 
.spreading  across  each  interface  and  for  this  case  is  related  to  q(s)  by 

qr(s)  =  (-l)afg(s)  =  (-0a,,fc7i(s)  +  <7z(s)l 
where  ar  is  the  number  of  reflections  from  an  interface.  qr(s)  is  the 

actual  spreading  factor  which  occurs  in  the  amplitude  of  the  beam  [see 

Cerveny  and  Psencik,  1983c].  For  simplicity,  the  parameter  c  is  set  to 

-i  .  The  horizontal  scale  for  gr(s)  is  ten  times  larger  than  the  vertical 

scale. 

For  ray  theory,  caustics  occur  when  g2(s)  =  /?leaZ(g(s))  goes  to  zero. 
Several  zero  crossings  for  Real(qr(.s ))  are  shown  in  Figure  12b,  for 
example,  between  points  b  and  c.  By  combining  the  results  for  an  initial 
plane  wave  and  an  initial  line  source,  the  magnitude  of  the  complex 
function  qr{s)  never  equals  zero.  In  Figure  12b.  '.he  plot  of  qr(s )  cycles 
counterclockwise  about  the  origin,  but  never  passes  through  it. 

Using  ray  theory,  a  phase  advance  of  tt/2  must  be  introduced  at  each 
caustic.  This  is  dene  using  t.he  KMAH  index.  Using  a  complex  qr(s),  the 
phase  of  qr{s)  automatically  changes  by  n  between  zero  crossings  of 
reai(gr(s))  .  Thus,  as  long  as  the  phase  of  qr(s)  is  kept  continuous, 
taking  a  square  root  results  in  the  proper  nmber  of  tt/  2  phase  shifts. 

Example  G 
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In  this  example  the  effect  of  the  scale  of  a  velocity  inhomogeneity  on 
the  Gaussian  beam  solution  is  investigated.  A  single  heterogeneity  is 
considered  with  an  incident  plane  wave  impinging  from  below.  The 
background  velocity  is  8  km/sec  and  the  wavelet  parameters  are. 
/  =1  hz,  7=4.  <Pq—0  .  The  velocity  heterogeneity  is  circular  and  the  velocity 
varies  smoothly  in  a  Gaussian  manner.  The  radius  of  the  heterogeneity 
ranges  from  15  to  25  km  with  a  velocity  contrast  of  10%  lower  than  the 
background.  The  Gaussian  beam  solution  is  then  compared  with  the  finite 
difference  solution  of  the  45  degree  parabolic  equation  typically  used  in 
seismic  exploration  and  forward  scattering  simulations  [see  Claerbout, 
1976].  The  ray  diagram  for  a  25  km  radius  heterogeneity  is  shown  in 
Figure  13. 

Figure  14  shows  the  seismograms  for  an  inhcmogeneity  with  a  radius 
of  25  km.  a  velocity  contrast  of  10%,  a  distance  from  the  initial  wavefront 
to  the  heterogeneity  of  50  km.  and  a  distance  from  the  initial  wavefront  to 
the  stations  of  120  km  The  finite  difference  solution  of  the  parabolic 
equation,  Farab45,  is  shown  in  Figure  14a.  Figures  I4b-d  show  the 
Gaussian  beam  solutions  for  three  initial  beam-widths.  The  critical  initial 
beam-width  for  the  station  distance  in  a  homogeneous  medium  is 
L0  =  1 1  fern 1/2  From  the  results  of  Example  A,  the  decomposition  of  an 
initial  plane  wave  m  a  homogeneous  medium  requires  initial  beam-widths 
comparable  or  smaller  than  the  critical  initial  beam-width  for  the  receiver 
distance.  The  maximum  amplitude  for  each  solution  is  shown  in  Figure  14 
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Figure  13  Ray  diagram  for  a  single  heterogeneity  with  a  radius  of  25  km 
and  a  velocity  contrast  of  10%. 


»  r u?. vm yjtvv v'tx vv *«  *» l*v  -*%  , v vv m//- jvlw vi l. v r~o.vi .w  rvi \ 


-  rj*  r  jr  .lit  •  « 


SINGLE  INHOMOGENEITY 


60 


Figure  14  Single  velocity  heterogeneity  with  a  radius  of  25  km.  a  velocity 
contrast  of  10%.  the  distance  from  the  initial  wavefront  to  the 
heterogeneity  of  50  km.  and  the  distance  from  the  initial  wavefront  to 
the  stations  of  140  km.  a)  Parab45  seismograms  Gaussian  beam 
seismograms  for  b)  £0=lO  km}/z,  c)  L0=7  km.l/z,  and  d)  L0=4  kmuz. 
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The  Gaussian  beam  results  for  the  different  initial  beam-widths  are  seen  to 
be  stable  and  compare  with  the  Parab45  result. 

Figure  14  also  shows  the  beam-width  in  km  at  the  inhomogeneity 
distance  for  each  initial  beam-width  solution.  The  smallest  beam-width  at 
the  heterogeneity  distance  is  14  km  resulting  from  an  initial  beam  width  of 
L0=7  Jfcml/3,  and  is  smaller  than  the  heterogeneity  radius.  From  example 
A,  this  smallest  beam-width  defines  the  zone  of  contributing  beams  to  the 
final  sum.  For  smaller  than  critical  initial  beam-widths  a  similar  limited 
region  is  defined  by  phase  interference  about  the  stationary  direct  arrival 
and  is  related  to  the  first  Fresnel  zone. 

Figure  15  shows  the  same  experiment,  except,  now  the  initial  wavefront 
is  140  km  from  the  heterogeneity  and  210  km  from  the  stations.  The 
Parab45  result  is  shown  in  Figure  15a,  and  the  Gaussian  beam  seismograms 
for  four  initial  beam- widths  are  shown  In  Figures  15b-e.  Due  to  the  spread 
of  the  beams  with  distance,  the  beam-widths  at  the  heterogeneity  are 
larger  than  the  previous  case.  The  smallest  beam-width  at  the 
heterogeneity  distance  is  23  km  and  is  just  slightly  smaller  than  the 
heterogeneity  radius.  The  Gaussian  beam  results  are  still  stable  and 
compare  with  the  Parab45  result.  The  Lq-A  km 1/2  result  has  a  much 
larger  beam-width  of  50  km  at  the  heterogeneity  distance  but  the  number 
of  beams  contributing  is  now  limited  by  phase  interference.  The  amplitude 
for  this  case  is  fairly  stable  but  a  spurious  secondary  phase  can  be  seen. 


In  the  final  case,  the  inhomogeneity  radius  is  reduced  to  15  km  with  a 
velocity  contrast  of  10%,  a  distance  from  the  initial  wavefront  to  the 
heterogeneity  of  30  km,  and  a  distance  from  the  initial  wavefront  to  the 
stations  of  2L0  km.  The  Parab45  result  is  shown  in  Figure  16a.  The 
Gaussian  beam  results  are  shown  in  Figures  I8b-e  for  several  initial  beam- 
widths.  For  this  case  the  Gaussian  beam  amplitudes  are  less  stable.  The 
Gaussian  beam  solution  with  the  smallest  beam-width  at  the  heterogeneity 
compares  best  with  the  Parab45  result.  All  the  Gaussian  beam  solutions 
have  a  larger  secondary  arrival  than  the  Parab45  solution.  This  may  in 
part  be  due  to  dip  filtering  in  the  Parab45  formulation. 

These  preliminary  results  indicate  that  when  using  the  Gaussian  beam 
method  in  a  heterogeneous  medium,  the  critical  beam-width  be  smaller 
than  the  heterogeneity  scale.  This  is  more  restrictive  than  simply  small 
wavelength  with  respect  to  heterogeneity  scale,  and  is  related  to  the 
approximation  made  in  the  final  term  in  equations  (5)  or  (7).  This 
restriction  is  similar  to  that  given  for  geometric  optics  by  Kravtsov  and 
Orlov  [IS80]  m  which  the  medium  and  wave  parameters  must  be  slowly 
varying  over  the  first  Fresnel  zone. 


Example  H 

In  this  example.  511  waves  m  a  2-D  basin  structure  are  investigated. 
This  problem  has  been  studied  by  a  number  of  investigators  using 
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techniques  including;  finite  difference,  finite  element,  discrete 
wavenumber,  and  Glorified  Optics(a  ray  method)  [Boore  et.  al.V  1971;  Hong 
and  Helmberger,  1977;  Bard  and  Bouchon,  I960;  Aki  and  Richards,  i960]. 
The  basin  structure  varies  in  thickness  from  t  km  at  the  edge  to  6  km  in 
the  center  over  a  range  of  25  km  The  interface  has  a  cosine  shape.  The 
material  parameters  in  the  basin  are  v^.7  km/ sec  and  Pi=2.0  gm/cm 3 
and  the  material  parameters  of  the  basement  are  u2=3.5  km/ sec  and 
p2=  3.3  gm/cm 3  . 

Figure  17  shows  the  ray  diagrams  for  the  direct  wave  and  the  first  2 
bounces.  The  rays  represent  an  upward  traveling  SH  plane  wave  normally 
incident  from  below.  The  Gaussian  beam  synthesis  was  made  up  of  t2 
bounces.  The  incident  plane  wave  was  expressed  by  equally  spaced  rays  in 
the  basement.  No  two-point  ray  tracing  was  required;  the  rays  were  simply 
allowed  to  reverberate  in  the  basin.  For  this  case,  150  rays  were  used  with 
a  ray  spacing  of  l  km 

The  damped  cosine  source  wavelet  used  in  the  Gaussian  beam 
synthesis  (see  eqn.  21)  was  adjusted  to  simulate  an  often  used  Ricker 
wavelet  [see  F^rd  and  Bouchon,  1980).  In  Figure  10,  the  solid  line  is  the 
Ricker  wavelet,  and  the  dashed  line  is  the  damped  cosine  wavelet  used  in 
the  Gaussian  beam  synthesis  with  7=16.5  sec  and  7=3.2  The  value  of 
L0  was  set  at  2.0  <c7nw' 
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Figure  p  Ray  diagrams  for  an  SH  plane  wave  incident  on  a  2-D  basin 
structure.  The  direct  wave  and  first  2  bounces  are  shown. 


Figure  18  Comparison  between  a  Ricker  wavelet  (solid  line)  used  in  othe 
studies  of  the  soft  oaftn.  and  the  damped  cosine  wavelet  (dotted  line 
used  for  the  Gaussian  beam  seismograms  with  T=l6.5  sec  and  7=3.2 


Figure  19  Comparison  of  seismograms  computed  by  various  methods  for  a 
soft  basin,  DW  •  :  .-Crete  wavenumber,  GO  -  Glorified  Optics  (a  ray 
method),  FE  -  c  e.ement.  GB  -  Gaussian  beam.  The  center  of  the 
basin  is  marked  a?  J  km.  and  the  edge  as  20  km. 


Figure  19  shows  a  comparison  of  four  different  methods  for  the  2  D 
basin  structure.  Thu  Gaussian  beam  seismograms  have  been  added  to 
Figure  13.26  in  Aki  and  Richards  [i960].  The  center  of  the  basin  is  marked 
as  0  km.  and  the  edge  of  the  basin  as  20  km.  For  each  distance,  the 
Gaussian  beam  seismogram  is  at  the  bottom  The  second  phase  on  the 
seismograms  represents  energy  propagating  across  the  basin.  The 
seismograms  calculated  by  the  Gaussian  beam  method  compare  well  with 
the  other  solutions  at  the  earlier  portions  of  the  seismograms. 
Discrepancies  among  the  various  methods  are  evident  in  the  latter 
portions  of  the  seismograms.  This  may  be  due  to  the  fact  that  both  the 
Gaussian  beam  method  and  Glorified  Optics  are  high-frequency  methods, 
while  the  discrete  wavenumber  method  is  appiicah.e  over  a  broader  range 
of  frequencies. 


Example  I 


Tn  this  example,  the  Gaussian  beam  method  is  applied  to  the  study  of 
volcanic  earthquakes  under  Mt.  5t.  Helens.  During  the  summer  of  1981.  a 
9-station  array  was  deployed  on  the  flanks  and  within  the  crater  of  the  Mt. 
St.  Helens  volcano  This  was  a  joint  effort  undertaken  by  Oregon  state 
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observations  of  the  recorded  high-frequency  volcanic  events  were.  1)  the 
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arrival  times  at  the  crater  station  were  between  0.4  and  0.7  seconds  earlier 
than  those  at  the  flank  stations,  and  2)  the  amplitudes  at  the  crater 
station  were  about  10  times  greater  than  those  recorded  at  the  flank 
stations. 


Figure  20  shows  the  2-D  model  of  the  velocity  structure  which  was 
tentatively  adopted  for  Mt.  St.  Helens.  This  model  is  based  on  evidence 
obtained  by  various  workers  on  Mt.  St.  Helens  [Malone,  1982]  as  well  as  on 
other  similar  types  of  volcanoes  such  as  St.  Augustine  in  Alaska  [Kienle  et. 
al..  1979]  and  Showa-shinzan  in  Japan  [Hayakawa.  1957].  In  all  these  cases, 
the  internal  structure  of  a  volcano  is  depicted. as  a  high-velocity  central 
body  with  a  P  velocity  of  about  4  km/sec  covered  by  lower-velocity  surflcial 

layers. 


It  was  first  thought  that  the  observed  high  amplitude  and  early  arrival 
time  could  be  explained  by  locating  the  earthquake  foci  at  the  bottom  of 
the  hypothesized  high-velocity  column  Figure  20  shows  the  ray  diagram 
for  a  focus  located  3  5  km  below  the  summit.  The  high-velocity  column 
tends  to  cause  energy  to  diverge  from  the  summit.  Vertical  component 
Gaussian  beam  seismograms  for  stations  located  across  the  volcano  from 
2.5  km  to  7.5  km  along  the  surface  are  shown  in  Figure  21  using 
frequencies  of  5  :  2  an 4  20  H 7  The  amp'itudes  at  the  flank  stations 
appear  to  be  comparable  to  those  at  the  crater,  and  'he  differences  in 
arrival  time  are  vvithiii  0  25  seconds. 
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Figure  20  Velocity  model  used  to  depict  the  structure  under  Mt.  it 
Ray  diagrams  Ar°  -h  below  for  a  source  located  3  5  km  ben.1 

summit  and  a  source  located  1.0  km  below  the  summit. 
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If  the  focal  depth  is  moved  upward  to  l  km  below  the  summit,  the 
seismic  signal  changes  dramatically.  The  ray  diagram  for  a  focus  l  km 
below  the  summit  is  shown  in  Figure  20.  In  this  case,  the  ray  paths  going 
directly  upward  are  dense  and  short,  while  those  toward  the  flank  of  the 
volcano  are  sparse  and  take  a  detour.  The  resultant  seismograms  for  5,  10, 
and  20  Hz  are  shown  in  Figure  22.  These  show  an  amplitude  at  the  crater 
station  an  order  of  magnitude  greater  than  at  the  flank  stations,  and 
about  0.5  sec  earlier  arrival,  in  agreement  with  observation. 

In  order  to  check  the  accuracy  of  the  computed  seismograms, 
reciprocal  seismograms  were  compared  in  Figure  23.  In  case  A-,  a  2-D 
vertical  point  force  is  applied  at  a  point  3.5  km  beneath  the  summit,  and 
the  receiver  is  on  the  flank  of  the  volcano.  In  case  B.  the  vertical  point 
force  is  on  the  flank  of  the  volcano  and  the  receiver  is  located  3.5  km 
beneath  the  summit.  The  free  surface  is  neglected  here.  The  vertical 
component  seismograms  are  shown,  for  both  cases,  for  frequencies  of  5, 
10,  and  20  Hz.  There  is  a  satisfactory  agreement  in  waveform  for  all  three 
frequencies.  The  amplitudes  agree  to  within  1%  for  20  Hz,  6%  for  10  Hz. 
and  12/5  for  5  Hz,  showing  better  accuracy  for  higher  frequencies,  as 
expected. 

Thus,  the  D-v<?!cc:ty  structure  depicted  m  c*ig-:ro  2C  and  focal  depths 
less  than  i  km  from  the  summit  can  explain  the  two  significant 
observations.  A  similar  observation  was  made  for  5t  Augustine  volcano  by 
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Figure  21  Gaussian  beam  seismograms  tor  a  source  located  3.5  km  below 
the  summit  using  three  frequencies;  o,10.  and  20  hz. 


‘WS 


74 


SHALLOW  SOURCE 
(1.0  km) 


flgur*  22 


Figure  22  Gaussian  beam  seismograms  for  a  source  located  1.0  km  below 
the  summit  using  three  frequencies:  5,10,  and  20  hz. 
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Figure  23  Ray  diagrams  and  seismograms  shown  for  reciprocal  cases,  A 
and  B.  The  reciprocal  seismograms  are  shown  for  three  frequencies; 
5,10,  and  20  hz. 
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Lalla  and  Kienle  [1982],  who  found  a  correlation  between  the  amplitude 
ratio  of  summit  to  flank  stations  and  the  focal  depth.  For  focal  depths 
shallower  than  about  2  km,  the  summit  amplitude  was  greater  by  an  order 
of  magnitude  than  the  flank  amplitude,  and  for  deeper  events,  the  opposite 
was  observed. 

In  any  case,  the  sensitivity  of  the  seismic  signal  to  the  focal  depth  is 
strong,  due  to  the  heterogeneous  structure  under  a  volcano.  This  result 
also  suggests  a  promising  way  of  accurately  locating  volcanic  earthquakes 
using  variations  in  amplitude  as  well  as  in  arrival  time. 


Example  J 

In  this  final  example,  the  Gaussian  beam  meihod  is  used  to  study  the 
influence  of  small-scale  fluctuations  on  teleseismic  P-waveforms.  The 
models  consist  of  several  realizations  of  a  lithosphere  with  randomly 
fluctuating  velocities.  The  layer  thickness  is  12C  km  with  a  correlation 
length  of  15  km.  The  average  velocity  is  8  km/sec  and  the  root  mean 
square  fluctuation  is  3%.  roughly  representing  the  results  obtained  for 
LASAfAki.  1973;  Capon.  1974;. 

Normally  distributed  random  velocities  were  used  to  construct  a  2-D 
velocity  grid  with  c  '  5-.-<m  spacing  between  mesh  points.  This  was  then 
interpolated  using  hi.'.ib’c  splmss  to  give  continuous  velocities  as  well  as 
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first  and  second  derivatives.  The  first  realization  for  the  randomly 
fluctuating  velocities  is  shown  in  Figure  24.  Above  is  shown  a  contour  plot 
of  the  random  velocities  contoured  at  0.  L  km/sec  interval.  Below  is  shown 
a  printer  plot  of  the  random  velocities  where  each  integer  on  the  plot 
represents  a  0.3  km/  sec  velocity  interval  and  the  integer  ''4"  represents 
the  velocity  interval  7  7  to  0.0  km/sec. 

Figure  25  shows,  the  ray  diagram  for  a  vertically  incident  plane  wave 
perturbed  by  the  random  velocity  layer.  The  receivers  are  located  just 
above  the  random  layer  at  z=75  km  The  range  of  receivers  is  from  125  to. 
275  km.  spaced  at  5  km  intervals.  Figure  26  shows  the  resulting  wavefleld 
for  two  dominant  frequencies,  2  IIz  and  l  IIz.  The  amplitude  and  phase 
have  both  been  significantly  distorted  by  the  random  velocity  layer.  A 
multipath  effect  can  also  be  seen  in  the  range  from  130  to  160  km  No 
significant  P  coda  with  a  duration  beyond  about  3  seconds  has  developed 
as  a  result  of  the  random  velocity  layer.- 

In  order  to  check  the  accuracy  of  the  method  for  this  example, 
seismograms  from  reciprocal  geometries  have  been  compared  using  a  2-D 
point  force  as  a  source.  The  first  comparison,  shown  in  Figure  27,  is  for  a 
source  and  receiver  300  km  above  and  below  the  random  velocity  layer.  In 
this  geometry,  -.re  rays  'or  the  reciprocal  cases  sample  the  same  region  of 
the  random  layer  There  is  a  good  agreement  m  waveshape  between 
reciprocal  cases.  Also,  the  amplitudes  agree  within  5”  For  this  case,  the 
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random  velocity  layer 
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Figure  24  Contour  plot  and  printer  plot  of  a  layer  with  randomly 
fluctuating  velocities  The  average  velocity  is  8  km/sec  with  a  root 
mean  square  fluctuation  of  3%.  The  correlation  length  is  15  km  The 
contour  interval  is  l  km/sec  for  the  contour  plot,  and  .3  km/sec  for 
the  printer  plot.  iseed=0. 
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Figure  27  Ray  diagrams  and  seismograms  for  the  shown  reciprocal  cases  A 
and  B.  For  case  A.  the  source  is  at  (200,800)  and  the  receiver  is  at 
(200,80).  For  case  B.  the  source  is  at  (200.80)  and  the  receiver  at 
(200,800).  The  random  layer  velocities  are  shown  in  Figure  24  "  The 
seismograms  a^e  computed  for  two  frequencies  l  and  2  hz! 
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parameter,  50  .  was  specified  to  put  the  minimum  beam  width  (beam  waist) 
just  before  the  ray  entered  the  random  layer,  and  L0  was  specified  as 
12.  kmwz. 

In  addition  to  reciprocity  checks,  the  calculation  was  repeated  using 
the  finite  difference  solution  of  the  45  degree  equation  [Claerbout,  1976]. 
Figure  ?8  shows  a  comparison  between  the  parabolic  result,  Parab45,  and 
the  the  Gaussian  beam  result  for  the  1  hz  case.  The  results  are  similar  in 
the  significant  details. 

In  Figure  29.  a  second  realization  for  the  random  velocity  layer  is 
shewn.  The  perturbed  rays  for  an  incident  plane  wave  are  shown  in  Figure 
30.  The  wavefields  observed  on  the  surface  of  the  random  velocity  layer 
for  the  two  dominant  frequencies  of  2  Hz  and  l  Hz  are  shown  in  Figure  31. 
This  realization  perturbs  the  wavefteld  less  than  the  previous  realization. 
The  results  have  been  checked  using  reciprocity  and  compared  against  the 
Parab45  solution  and  showed  good  agreement.  For  this  realization,  a 
comparison  has  been  made  between  the  ln_amplitude  and  the  residual 
phase  delay  This  is  shown  in  Fig -ire  32.  The  positive  correlation  between 
ln(A)  and  the  phase  delay  residual  is  in  agreement  with  observations  of 
teleseismic  P  waves  at  the  Montana  LASA  by  i-'ki  [1973] 


Figure  31  Wavefield  cf  vertically  incident  plane  wave  propagated  through 
random  layer  shown  in  Figure  29  Two  frequencies  are  used.  1  and  2  hz. 


2.2.4  CONCLUSIONS 


There  are  several  advantages  in  using  the  Gaussian  beam  method. 
First,  the  method  is  always  finite  at  caustics.  In  addition,  no  prior 
knowledge  of  caustic  locations  is  required  as  in  other  methods.  Next,  since 
the  Gaussian  beam  method  relies  on  local  paiabolic  approximations,  no 
global  direction  of  propagation  must  be  assumed  as  in  the  standard 
parabolic  approximation.  Finally,  the  Gaussian  beam  method  is 
comparable  in  cost  to  ray  methods,  and  possibly  faster  since  no  two-point 
ray  tracing  is  done. 

The  Gaussian  beam  result  depends  on  the  choice  of  the  initial  beam- 
width.  The  expansion  of  a  plane  wave  by  Gaussian  beams  in  a 
homogeneous  medium  requires  initial  beam-width?  comparable  or  smaller 
than  the  critical  initial  beam  width  for  a  given  station  distance.  This  is 
similar  to  an  expansion  of  an  initial  plane  wave  into  point-like  sources  as 
viewed  from  the  station.  The  approximate  expansion  of  a  line  source  into 
Gaussian  beams  in  a  homogeneous  medium  requires  initial  beam-widths 
comparable  or  larger  than  the  critical  initial  beam-width.  This  is  similar  to 
an  expansion  of  a  line  source  into  plane  waves.  The  use  of  initial  beam- 
widths  near  critical  avoids  end  effects  but  doesn't  simulate  certain  arrivals, 
such  a«  haad  wave?  -n  *.r.e  we'ried  interface  example 

The  Gaussian  beam  method  in  a  heterogeneous  medium ’has  been 
tested  using  two  approaches.  The  first  is  the  application  of  the  reciprocal 
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theorem  where  the  discrepancy  between  seismograms  calculated  for 
reciprocal  cases  is  considered  as  a  measure  of  error.  The  second  is  the 
application  of  the  Gaussian  beam  method  to  cases,  including  the  2-D  basin 
problem,  which  have  solutions  from  other  approximate  methods. 
Preliminary  results  for  a  single  heterogeneity  using  the  finite  difference 
solution  to  the  parabolic  equation  and  the  Gaussian  beam  method  gave 
best  comparisons  for  critical  beam_widths  smaller  than  the  heterogeneity 
scale. 

The  Gaussian  beam  method  was  then  applied  to  the  study  of  volcanic, 
earthquakes  under  Mt.  St.  Helens.  The  observed  differences  in  amplitude 
and  arrival  time  between  a  station  inside  the  crater  and  those  at  the  flank 
can  be  explained  by  the  combined  effects  of  an  anomalous  velocity- 
structure  and  a  shallow  focal  depth.  The  method  was  also  applied  to  study 
the  influence  of  small*scale  velocity  fluctuations  on  teleseismic  P 
waveforms  Significant  amplitude  and  phase  fluctuations  due  to  a 
lithosphere  with  randomly  fluctuating  velocities  were  found,  in  agreement 
with  observations  by  Aki  [1973]  at  the  Montana  LASA. 


|  . 
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2.2.5  APPENDIX  1 


In  this  appendix,  we  give  a  derivation  of  the  parabolic  equation  from 
the  eikonal  equation  [see  Gioge  and  Marcuse.  1969;  and  Marcuse,  1982].  By 
considering  a  variable  endpoint  in  Fermat’s  integral,  the  eikonal  or 
Hamilton-Jacobi  equation  can  be  obtained 

|  Vrj2  =  u"2  (At.  1) 


The  rays  are  the  characteristics  of  this  equation.  In  ray-centered 

coordinates,  the  eikonal  equation  can  be  written 

2L_lLfi_„a(|l.)S]i/8=o  (A1.2) 

os  v  on 

where  h. t  is  the  scale  factor  in  (2).  The  Hamiltonian  is  written  as 

ff(n.Pn)=-^4l-vWi1|  (At -3) 

and  p-  =  is  the  slowness  in  the  n.  direction  and  is  identified  as  the 
dn 

generalized  momentum.  The  Hamiltonian  can  be  approximated  to  second 
order  about  the  central  ray  [see  Cerveny.  1981]  as; 

H(n.pn)  =  -i-  +  |-vpn2  +  (A1.4) 


Following  Marcuse  [1982]  a  parabolic  equation  can  be  derived  by  treating 


all  variables  as  epe rulers  i;*.  analog*  - C'.ar.wum  me^haniwu 


a  „  .  a 
n9n-  '  l*a7 


(Al  5) 
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bcsn. v  > 7-j.t xrxvirry rirvmiy* 


cvvxj/v  vh  tv  c*  yv  ivmwa-j  lvuxvvyv  w  w  uv  v«vw  w:  w 


where  *  la  a  parameter  to  be  determined.  Applying  the  operator  H  to  a 
wave  [unction.  V' . 


(A  1.6) 


Using  equations  (A  1.4)  and  (A  1.5),  this  is  written 


(A1.7) 


/•da  x 

Writing  V'  in  terms  of  the  envelope,  ,  and  assuming 


u=cj1/2n  where  i/=0(l)  ,  gives 


2  i 


/cuv(s) 

Letting  «=cj-1  gives 


a^u^s)3 


+  ( 


it^CJvCs)2  ACU(s) 


4 4^.,  +  =0  (A1.9) 

This  is  equivalent  to  the  parabolic  equation  in  (7)  derived  from  asymptotic 
analysis.  Note  that  in  quantum  mechanics  where  h  is  Planck's 

constant,  but  here  k=v0 v0  can  be  avoided  here  by  writing  the 

2tt 

eikonal  equation  (Al  2)  in  terms  of  r'=vQr.  Classical  ray  theory  is  valid 
when  A0  is  much  smaller  than  the  heterogeneity  scale.  The  natural  analogy 
between  mechanics  und  ray  theory  .l.us  provides  a  i*iiK  between  the 
eikonal  equation  and  the  parabolic  equation.  Further  results  from  this 
analogy  can  be  found  tn  Marcuse  [1932]  and  Marcuvitz  [I9o0j. 
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2.2.6  APPEWIX  2 


In  this  appendix,  a  brief  review  of  the  properties  of  a  Gaussian  beam  in 
a  homogeneous  medium  is  given.  From  the  dynamic  ray  tracing  equations. 
(10),  and  the  initial  conditions  for  a  plane  wave  and  line  source,  one 


obtains 


<7i(s)  =  l  Pi(s)=0 
<72(s)=s-s0  7>2(s';  =  L/v0 


(A2.L) 


For  the  source  at  s0=0,  with  the  beam  waist  at  the  source,  50=0,  and  using 
(16).  the  complex  spreading  can  be  written 


q(,)  =  s  -i^Li 


(A2.2) 


where  Ly  is  the  initial  beam-width  at  the  beam  ‘.'-.ist.  The  beam-width  can 


be  written  from  (l7)  as 


£(*)=■ 

nLj& 


(A2.3) 


The  phase  front  curvature  can  be  written  from  (18)  as 


K(s)  = 


(A2.4) 


At  the  beam  waist.  LKs)-Ly  and  K[s)=  0.C  Thus  at  the  beam  waist. 


minimum 
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the  angular  spread  or  tne  far  field  diffraction  angle  can  be  defined  as 
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\ 

"  Lu 


This  leads  to  the  uncertainty  relation 

vLu  =  n 


(A2.5) 


For  a  small  Lu  there  is  a  large  far-Heid  spread,  whereas  for  a  large  Lu, 
there  is  a  small  far-field  spread.  The  distance  s^  at  which  the  beam-width 
starts  to  significantly  differ  from  the  initial  beam-width  is 


Lu  nL& 

<p  *  -  sc  =  nr- 

Sc  A 


(A2.0) 


This  is  similar  to  the  Fraunhofer  range  for  an  aperture  of  radius  a  which 

2  i 

has  the  forms»4-  For  \s  |  »sc,  ,  which  is  the  curvature  of  the 

A  S 


wavefront  for  a  line  source  Extrema  in  the  curvature  occur  at  s  ,  =sc 


The  critical  value  of  Lu  is  the  one  which  gives  the  smallest  beam-width, 
L(s),  at  the  receiver  distance,  s.  In  a  homogeneous  medium,  this  places 
the  receiver  distance  ats=s<«  Thus  fromfA2.6).  Luopt  =v^s / 77 • 


2.2.7  APPENDIX  3 


In  this  appendix,  the  weight  function  $(<5)  for  a  2-D  point  force  in  an 
elastic  medium  is  obtamed  by  comparing  the  Gaussian  beam  synthesis  with 
the  exact  plane-wave  decomposition.  For  a  general  point  force  at  the 
origin  directed  in  the  x3  direction,  the  displacement  field  for  an  isotropic, 
homogeneous  medium  is  given  by 

.  r 

IA  =  -£-(  (37*73  “  +  ^r"3  ”  ^<73  “  <5i3)(^)_1]e  ^(A3.l) 

“  ~r[  (37^73  "<5,3 )(t—  +  -l-Jr"3  -7i73(a2r)“l]«iUa 

[Aki  and  Richards.  1980],  where  a  time  dependence  of  e~*ut  is  assumed, 
and  the  direction  cosines  of  t*  are  given  by  7*=xl/ r  .  In  order  to  express 
the  above  displacement  as  an  integral  superposition  of  plane  waves,  the 
Weyl  integral  is  used  to  obtain 


fkx2*'*'*k*'*k'*'dJcxdk2 
ax3  dir,  1  r  J  2tt  U  '  12 


(A3. 2) 
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where  fcc  =  + 


.  and  c 


is  either  a  or  0  .  Then,  for  the  3-D 


point  force 


(A3. 3) 


Bnzpur  J-mJ  kpfi* 

Now  we  will  investigate  the  "P-wave"  integral  alone,  and  integrate  with 
respect  to  xg  to  obtain  the  line  force  solution.  Thus 


where  it  has  been  assumed  that  $(<5)  has  the  form.  <t(<5)-^'(d) V OqPo^o 
For  a  homogeneous  medium. 


^ob'+e]. 


where  c=S0-i-^—Lfi  .  To  compare  with  the  plane  wave  solution,  let  Lfj  -*  « 

CVq 

and  let  <5  =  i?  .  Then 


(A3. 7) 


(A3. 8) 


where  $(t?)  is  the  approximate  weight  for  the  Gaussian  beam 
representation  of  P-wave  radiation  from, a  2-D  point  force,  and  is  the 
angle  between  the  direction  of  the  point  force  and  the  beam  direction. 
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2.3  TOPICS  IN  GAUSSIAN  BEAM  SYNTHESIS 
2.3.1  INTRODUCTION 

Many  decompositions  of  waveflelds  are  possible  in  terms  of  local, 
global,  or  intermediate  building  blocks.  These  building  blocks  include  rays, 
inodes,  and  beams  In  this  paper  we  investigate  several  topics  in  the 
approximate  decomposition  of  waveflelds  in  terms  of  beams.  These  topics 
include  the  specification  of  the  beam  parameter,  c  .  and  its  interpretation 
in  terms  of  a  complex  source  point.  Asymptotic  and  exact  evaluations  of 
the  Gaussian  beam  integral  for  a  line  source  are  then  investigated.  Then, 
the  problem  connected  with  the  decomposition  of  a  point  source  in  a 
velocity  gradient  is  discussed,  and  reduces  to  the  difference  between  an 
initial  plane  wave  and  an  initial  Snell  wave.  A  comparison  with  the  PP 
phase  in  the  Imperial  Valley  structure  is  then  made.  Next,  a  slant  stack  of 
a  wavefleld  is  computed  and  compared  with  the  beam  decomposition  for  a 
large  beam  width,  showing  good  agreement.  A  first  order  correction  to  the 
plane  wave  reflection  coefficient  typically  used  at  interfaces  in  the 
Gaussian  beam  method  is  next  described.  Finally,  the  finite  frequency 
scattering  effects  from  a  dented  interface  are  computed  using  the 
Gaussian  beam  method  and  compared  with  the  effects  computed  using  the 


Aki-Larner  method. 
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2.3.2  Choice  of  the  complex  parameter  e 

Each  Gaussian  beam  in  the  Gaussian  beam  superposition  requires  the 
choice  of  a  complex  parameter,  c.  To  be  consistent  within  the 
superposition,  one  value  of  e  is  usually  chosen  for  all  beams  in  the  sum 
The  real  part  of  c  shifts  the  beam  waist  away  from  the  start  point  on  the 
ray.  The  imaginary  part  of  e  specifies  the  beam  width  at  the  beam  waist. 
The  appropriate  range  of  values  for  c  was  shown  by  Nowack  and  Aki  [1984] 
to  depend  on  the  type  of  initial  wavefleld  to  be  approximated  by  Gaussian 
beams.  When  decomposing  an  initial  plane  wave  into  Gaussian  beams, 
useful  results  with  a  finite  aperture  occur  for  small  \c\  In  the  limit  as 
'e!  -*  6,  the  Gaussian  beam  superposition  reduces  to  a  KirchhofT  integral 
with  a  paraxially  approximated  Green’s  function  When  decomposing  a 
point  source  into  Gaussian  beams,  useful  results  occur  for  large  jci  (see 
N'owack  and  Aki  [1984],  example  C)  In  the  limit  as  ]ci  -*«  .  the  Gaussian 
beam  superposition  reduces  to  a  plane  wave  decomposition  or  a  Wevl 
integral  for  a  point  or  Line  source  A  finite  value  of  'e  in  the  above  cases 
has  the  result  of  smoothing  over  diffraction  effects  (see  N'owack  and  Aki 
[1984],  example  B  for  the  diffraction  effects  of  a  half  plane  and  example  D 
for  the  generation  of  a  head  wave).  A  finite  i c i  compresses  the  energy  to 
a  vicinity  of  the  stationary  points.  The  choice  of  e  depends  on  what  part 
of  the  wavefleld  one  is  interested  in  modelling.  The  critical  value  of  Im(c) 
was  used  by  Nowack  acid  Aki  [1984]  (or  the  optimal  value  in  the 
terminology  of  Cerveny  et.  al.  [1982])  as  a  soft  cutoff  between  large  and 
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small  values  of  |c|  for  a  given  source  and  receiver  geometry.  The  above 
results  are  summarized  in  Figure  1.  Nowack  and  Aki  [1904]  also  found  in 
their  Example  G  that  the  beam  width  should  be  smaller  than  the 
heterogeneity  scale.  Moving  the  beam  waist  by  changing  the  real  part  of  c 
may  be  useful  particularly  when  a  localized  heterogeneity  exists.  The 
beam  waist  can  then  be  moved  to  the  region  of  the  heterogeneity.  For 
heterogeneities  distributed  along  the  whole  path,  the  beam  waist  is 
typically  put  at  the  source  or  the  receiver. 

2.3.3  THE  COMPLEX  SOURCE  POINT  APPROACH  TO  GAUSSIAN  BEAMS 

In  this  section.  Gaussian  beams  resulting  from  a  complex  source 
coordinate  is  described  The  complex  parameter  c,  for  each  Gaussian 
beam  in  the  Gaussian  beam  synthesis  can  be  considered  as  a  complex 
distance  from  a  source  point  in  complex  space  to  a  start  point  along  the 
ray  in  real  space.  Thus 

c  =  d.  -  ib  ( l) 

where  d.  is  the  real  distance  between  the  source  point,  or  the  beam  waist, 
and  the  start  point  and  corresponds  to  Sq  in  the  notation  of  Cerveny  et. 
al  [1904].  When  c  is  real,  it  corresponds  to  the  radius  cf  curvature  of  the 
wavefront  at  the  start  point.  For  a  complex  e,  d  can  be  used  to  shift  the 
beam  waist  along  the  ray  Putting  the  source  point  in  complex  space  by 


Figure  1  The  range  of  the  complex  parameter,  c  .  for  an  individual 
Gaussian  beam  and  their  useful  range  in  beam  synthesis. 
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specifying  6  *  0  results  In  an  evanescent  wavefleld  which  attenuates  along 
the  wavefront. 

The  complex  source  point  approach  to  constructing  evanescent 
waveflelds,  in  particular  Caussian  beams,  hast  been  described  by  Keller  and 
Streifer  [1969],  Deschamps  [1971],  Shin  and  Felsen  [1977],  and  Wu  [1905] 
Here,  the  2-D  Helmholtz  equation  is  considered  where  the  Green's  function 
is  specified. from 

(  V2  +  *2)  G(r.r')  =  -<5(r,r') 
and 

G(r,r)  =  i/  4  H^(kD) 

where  D  -  {  [x  -  x  )2  +  (z  -  z')2  )x'2  For  kD  »  l 

G(r,r)  »  1/4  (  2/rrkD  * ‘"/4  (2) 

The  Green's  function  is  now  analytically  extended  by  using  a  complex 
source  point.  Thus  r  -*  f  This  can  be  done  on  either  the  exact  or  an 
approximate  Green's  function.  For  example,  let  r  *  r  =  (0.0  +  ’6)  .  then 

D  •*  D  -  [  x2  +  (z  - 16 )2  ]l/2  Re  D  >  0 


In  the  paraxial  region  of  the  z  axis 


or 


D  *3  (z  -  ib ) 


l  +  1/2 


.5  =  z  -ib  + 


Z  X* 


2  (z2  +  62) 


i  b  x2 

2(x2  +  62) 
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The  analytically  extended  Green’s  function  in  the  paraxial  region  is  then 


ikwx*  -kb*1 


(3) 


This  is  a  Gaussian  beam  where  C  is  a  normalizing  constant.  The  beam 
curvature  is 


and  the  beam  width  is 


K{z) 


1 

z(  l  +  b2/z2) 


L(z)  =  {2b/ k)w2  (l  +  z2/  b2)l/2 


Comparing  with  equations  (A2.3)  and  (A2.4)  of  Nowack  and  Aki  [1904]  gives 


where  Lu  is  the  initial  beam  width  at  the  beam  waist  and  Lq  is  a 
normalized  initial  beam  width  independent  of  frequency  used  by  NowacK 
and  Aki  [1904],  The  complex  parameter  uan  then  be  written  in  two  ways 


€  -  Sq  -  i-z-  L&  =  d  -  ib 

& 

where  in  the  second  case  c  refers  to  the  complex  distance  from  the 
source  to  the  start  point  on  the  ray. 


The  fact  that  an  analytically  extended  Green’s  function  is  a  Gaussian 
beam  has  been  used  by  a  number  of  authors  fsee  for  example  Felsen 
[1976-.  Ra  et.  al.  fl97l'.  Pott  and  Harris  [1904])  to  mode!  the  effects  of 
beam  scattering  The  exact  or  approximate  scattering  results  for  a  point 
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source  is  found  and  then  expended  to  a  complex  source  point  to  give  the 
scattering  results  for  a  beam.  Thus  the  scattering  results  for  certain  types 
of  extended  sources,  Gaussian  beams,  can  be  gotten  from  the  analogous 
point  source  problem  without  performing  any  integration.  In  addition  the 
plane  wave  response  can  be  gotten  from  the  point  source  response  by 
letting  6  -*  «  . 

In  the  Gaussian  beam  synthesis  method,  a  quite  different  approach  is 
taken  where  approximate  Gaussian  beams  are  traced  through  the  medium 
and  superposed  to  simulate  the  scattering  effects  for  a  point  source  or  an 
arbitrary  extended  source.  The  complex  source  point  is  expected  to  be 
useful  in  evaluating  how  appropriate  the  approximate  Gaussian  beams  are 
in  different  complicated  geologic  media. 


2.3.4  THE  USE  OF  COMPLEX  PHASE 

Introducing  a  finite  imaginary  part  to  c  results  in  a  complex  phase  for 
the  component  beams  in  the  Gaussian  beam  synthesis  as  eqn.  (3)  This 
imaginary  phase  part  will  act  to  modify  the  integrand  of  the  Gaussian  beam 
synthesis.  In  order  to  represent  a  point  source  by  beam  superposition,  a 
value  of  b  loss  than  infinity  will  act  to  filter  the  plane  wave  components  in 
a  Gaussian  manner 


The  use  of  complex  phase  in  wave  superposition  integrals  has  been 
used  by  several  authors  in  order  to  obtain  various  computational 
advantages.  In  the  discrete  wavenumber  method  (see  Larner  [1970], 
Bouchon  and  Aki  [1977])  a  complex  frequency  is  used  to  reduce  the  effects 
of  artificially  induced  periodic  sources  resulting  from  discretizing  the 
wavenumber  integral.  Thus 

-*  e1Uft 

This  results  in  an  exponentially  damped  time  domain  trace  and  smoothing 
and  broadening  in  the  frequency  domain  since 

a,2  +  '-i2 

are  a  Fourier  transform  pair.  Since  a  =  vk  ,  addmg  a  small  complex  part 
to  the  frequency  and  requiring  v  to  be  real,  adds  a  complex  part  to  the 
wavenumber,  k  ,  also.  This  has  the  effect  of  moving  singularities  off  the 
real  axis.  Thus,  adding  a  small  complex  part  to  the  frequency  in  this  case 
can  be  used  to  gain  numerical  stability,  economize  on  computation  time, 
and  reduce  effects  of  artificial  periodic  sources  when  sampling  the 
wavenumber  integral.  The  integral  is  slightly  distorted,  but  this  is 
corrected  after  the  synthesis  by  multiplying  by  eXUit 

An  interesting  twist  on  this  strategy  was  given  recently  by  Madariaga 
and  Papadimitnou  [1985]  to  numerically  compute  the  Gaussian  beam 
impulse  response  having  singularities  at  the  geometric  arrivals.  A  small 
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complex  part  is  added  to  the  phase  time  on  the  order  of  the  time  spacing 
of  the  impulse  response  to  be  computed.  Thus 

tJ  -*  0  +  i£it 

The  result  of  this  is  to  Alter  the  high  frequencies  and  smooth  the  transient 
signal  by  convolving  with  a  function  which  has  a  width  on  the  order  of  the 
sample  spacing  Al  .since 

e  *j  j  I  ai  t  r  2A* 

i2  +  4t*  . 

are  also  a  Fourier  transform  pair.  Thus  singularities  at  geometric  arrivals 
are  slightly  smoothed  to  give  a  stable  sampled  impulse  response  without 
the  need  to  use  an  explicit  source  time  function  such  as  the  Gabor  wavelet 
as  was  used  by  Nowack  and  Aki  [1984]  A  particular  source  time  function 
can  then  be  incorporated  in  a  convenient  manner  a’,  a  later  point. 


2.3.5  Asymptotic  and  exact  solution  of  the  Gaussian  beam  synthesis  for  a 
line  source 

In  this  section,  the  asymptotic  and  exact  solutions  to  the  Gaussian 
beam  superposition  for  a  line  source  are  given.  The  geometry  for  a  line 
source  at  the  origin  and  a  receiver  at  (z,z)  is  shown  in  figure  (2a).  where 
Rz  =  (x2  +  z  •)  =  (a 2  +  a2)  and 

s  =  /?cos(?r  -  =  z  cos<p  +  xsin<p  =  v  (xpz  +  zp,) 


n  =  Rsin(<pr  ~<p)  =  XC0SV  “  zsin<P  =  v  (*P»  “  ZP*) 
pg  and  p ,  are  the  horizontal  and  vertical  ray  parameters  where 

pg  =  sinp/v  pt  =  cosp/ v 

The  wavefleld  generated  by  a  line  source  can  be  represented  in  terms  of 
the  2-D  Weyl  integral  as 

u(z,*.b)  =  (5a) 

1-  P* 

where  Im^r  &U  and  z  3?  U  .  Changing  the  integration  variable  to  angle, 
(p  ,  one  obtains 

u(x,z,u)  =i/4n  +c°«*)d<p  (5b) 

L 


where  the  contour  L  goes  from  -n/2-i«  to  +7T/2+i°°.  The  time 
domain  solution  due  to  a  line  source  is  given  by 


u{x,z,t)  =  1/  2rr 


Hit  -R/v' 


'/tt-RZ/v2 


The  Gaussian  beam  synthesis  m  terms  of  angle  can  be  written 


+n/2f 

u^{x.z,d)  -  i/4n  f 

-!T/2[  *■  T 


li/2  lu/ 


'l*  *  2(c^s) 


where  c  is  the  complex  beam  parameter  described  in  a  previous  section. 
Usually  only  homogeneous  central  rays  are  considered.  As  c  -  *  and 
extending  the  contour  to  include  the  inhomogeneous  waves,  then  this 
reduces  to  the  exact  solution  given  in  (5b). 


The  steepest  descent  method  is  now  applied  to  (6)  in  order  to  obtain 
the  asymptotic  solution  for  large  a,  following  Cerveny  et.  al.  [1982]. 
Equation  (8)  can  be  written  in  the  form 

/( cj)  =  f  F(<p)  dtp  for  o  -* «» 
c 

where 


This  can  be  written  as 


f(<p)=i/v\  flcosAy  +  ,  “  Sp  \--r 

J  [  2 (c  +  Rcosbp) 

where  =  '-pr  "  V  •  The  saddle  point  is  defined  by  /’(<?)  =  0  ,  where 


.  t  D  .  A .  ,  R  cosAg  Rz  sin2Aig 

^  ”  v  '  (  c  +  flcosA?  )  2 (e  +  flcosAv)2 


A  saddle  point  occurs  at  y  ■=  <pr  ,  the  same  as  for  the  exact  line  source 
representation.  There  is  an  additional  saddle  point  coming  from  equating 
the  bracket  to  zero  which  will  be  discussed  subsequently.  At  the  saddle 
point,  <p  -  ,  in  the  Gaussian  beam  case 

v,  _  -+.tR 

/  W  J-*  "  v(c  +  fl) 

The  exponent  can  then  be  approximately  written  as 


f{(fi)  «/(*r)  +  l/2/”(^)l^f  V 


/(„)..  i*/*- toi'  +  R)  <*’-*? 
Let  {(fir  -  (fi)z  =  4  «iW  then 


~  ,  *(M  +  »r/ 2  +  ^(7-^)) 

/(*)"</?/«  -R/2v\-±-\A  e  '  +  * 


fifi)  «x/?/v  -R/2v\  'i^[eos<8  +  x  sin/S 


where  B  =  2i?  +  rr/2  +  .4/?G  — =-=■  .  Now  choose  i?  such  that  sin/S  =  0 

€  +  /? 


which  gives  the  constant  phase  or  steepest  contour.  Thus  /S=titt.  This 


results  in  iJ  =  -  4-.4/?C  — £-=■  for  steepest  descent.  The  integral  can 

4  2  c  +  R 


then  be  approximated  as  (see  for  example  Aki  and  Richards  [1980],  box 


/(u)  =  1  .  '  utF{ Vr)*'* 

Uf  {(fir) 


{ x,z,u )  «  -i-  i  1/2  etuR/v  "n/A  •*  f  (ufl/v) 
4?r  urt  A 


which  is  also  the  asymptotic  expansion  for  the  exact,  solution.  Thus  for 
large  y,  the  Gaussian  beam  synthesis  for  the  saddle  point  fi  =  fir  gives  the 


correct  response.  This  will  be  invalid  when  c  =  0  for  the  Gaussian  beam 


synthesis  since  then  f"{<fir)  =  0  ,  In  order  to  investigate  this  behavior,  an 
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exact  transient  evaluation  of  the  Gaussian  beam  integral  is  done. 


The  Cagniard  method  is  first  used  to  evaluate  the  exact  transient 
response  (5c)  from  (5a).  The  substitution  J  =  -i«  is  made  In  (5a)  where 
y  is  real  and  positive.  Equation  (5a)  can  then  be  written 

=i/4n  /,-«****«>  *E2- 
-L  Pm 

This  allows  the  branch  cuts  to  be  changed  to  those  in  figure  (2b)  since  now 
R ept  aO  .  Now  let  t  ~pzx  +  pfz  .  and  deform  the  contour  in  the 
complex  p-plane  so  that  t  is  reaL  Thus 

.u(x.z.ff)  =t/47T /«”** 

Jc  Pt 

The  Cagniard  contour  CM  and  C-  is  shown  in  figure  (2b)  where 


t  >  R/  v 


According  to  Jordan's  Lemma  the  contributions  on  the  large  arcs  in  figure 
(2b)  go  to  zero  leaving  the  result 


u(x,z,f)  =  i/ 47r 


t/  47T 


u(x,z ,?)  = -i/ 4rr  f  - -  dt  +  i/  4 rr  f  - — 

fl/v  P*+  [  .  I  R/v  Pt- 


*Pi- 

dt 


where 
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dP»(v  -) 


dt 


=  ^^/-)i-^L\t2-^/^ 


-1/2 


and 


dP»(+/  -) 


df 


1 


*»<+/-) 


(+/  -H 


t2-*2/*2 


T7T 


Then 


u(*,z,?)  =  l/2n  /*  7 - - - 

«/v  It2  -&/v2 


i/z 


i“*  di 


Using  the  fact  that  L 


-l 


Je^df 


=  f(t)  H(t  —  i j)  .  the  exact  time 


domain  response  is  given  by  (5c). 


The  Cagniard  method  is  now  used  to  obtain  tr.e  transient  response  of 
the  Gaussian  beam  integral  in  (6).  First,  the  Gaussian  beam  integral  is 
converted  to  px  and  extended  from  to  +«  to  include  the  full  range 
of  ray  parameter  The  substitution  s  =  -i«  is  then  made,  where  ?  is  real 
and  positive.  Thus 


ugb{x,z ,?)  =  i/47T  f 


c  +  s 


1/2 


e 


Js  ,  n» 

V  2v(c  *■  3 


dPz 

Pm 


where  the  branch  cuts  are  defined  by  Rep,  a  0  .  Now  the  substitution  is 

.2 


made 


t  - 


n 


Since 


s2  +  n2  =  R2  , 


then 


t  - 


[  v  2v  (e  +  s ' 

_  dZ  _  s2  I 

—  +  -2-r — =t'  We  now  want  to  deform  the  contour  in  the  complex 
v  2v[e+s.( 


f 
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r  -^Jb  "  l  dps-  ds  ji 

+  t/4rr  /  — ~T  - - TT"  ^77 

r?v  +  s  P«-  “  l“‘ 


I  rfPz(v-)  _  (+/-K 

where  fcr  ~r\-  [.i-M- 


T  11  */« 

—  4s-  =  (Z?2  +  ft2)  >  Vztz 

-ib  +sjd<J  X 

'  11/2 

_d$_*L.=  s-vca-'^+w  v2t2  >  (/?*  +  6s) 

-ib  +  s  j  di  j  ^ 


[s8  -  «8]'I/S  =  [s  (tf  +  ivt)(b  -  (K*  +  b*)>  vzt* 

[s8  -  ff8|’1/2  =  [Z  (X  +  +  X)\'/2  v8t8  >(/!*  +  6C) 

The  final  response  is 

’[  -»*  ll/a  ds  U  TX1/al.  mt  -R/v)-*  t 
l-^+sj  dt  l  )  8trf(a  . 

where  )(  is  a  modification  factor  to  the  exact  solution  when  using  the 
Gaussian  beam  superposition,  where 

y'F  (v8i8  -  #)''*_  (S8  +  6«>„st8  . 

*  A-  [2  (6  -  ^)j1/8 

x  =  (u2(a  ~  *8t8>(/?8  +  <>8) 

X2  (x*  +  .?) 


This  modification  factor  is  shown  in  figure  (le).  The  modification  factor 
has  a  singularity  at  u2f2  =  (Z?2  +  b2)  After  this  time  the  modification 
factor  becomes  complex,  but  is  bounded.  This  additional  singularity  moves 
to  infinity  as  b  -*  «  To  check  the  existence  of  this  singularity,  we  note 
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that  the  function  /'(?)  =  0  given  previously  has  an  additional  zero  when 
the  bracket  term  is  zero,  giving  s  =  flcos <p  =  ib  (+/  -)  (B2  +  62)1'2  ,  For 
the  plus  sign  and  using  the  formula  for  s(f)  ,  this  corresponds  to 
t  =  +  z/v  ,  which  is  what  is  shown  here.  For  a  critical  value  of 

6  ,  this  secondary  singularity  occurs  at  1.41  times  the  geometric  arrival. 
In  the  p-plane  shown  in  figure  (2d)  this  corresponds  to  the  two  points,  A ^ 
and  Az  ■  The  problem  of  this  secondary  singularity  occurs  when  extending 
the  original  contour  form  to  +«  to  include. the  evanescent  beam 
waves.  It  would  be  difficult  to  define  central  rays  for  these.  The  usual 
Gaussian  beam  formulation  only  includes  central  rays  with  real  takeoff 
angles,  and  the  additional  singularity  won’t  in  general  be  included  in  the 
usual  Gaussian  beam  synthesis.  The  usual  Gaussian  beam  integral  thus 
does  not  correspond  to  the  exact  transient  soluucn  for  a  line  source.  It 
approaches  the  exact  solution  as  c  -*<*>  .  For  large  a  and  real  takeoff 
angles,  the  Gaussian  beam  integral  is  asymptotically  equivalent  to  the 
exact  integral  representation  as  long  as  c  *■  0  In  the  general  case,  the 
Gaussian  beam  response  is  good  near  the  geometric  arrival  but 
deteriorates  for  larger  times. 


2.3.6  Initial  point  in  a  medium  with  a  velocity  gradient 


In  this  section,  the  initial  conditions  for  the  variables  p(s)  and  q(s) 
that  solve  the  dynamic  ray  equations  are  discussed.  The  variable  q(s ) 

has  the  interpretation  of  a  spreading  and  ^•■2-  has  the  interpretation  of  a 
wavefront  curvature.  Gaussian  beams  can  be  formed  as  a  linear 


combination  of  two  independent  sets  of  solutions  to  the  dynamic  ray 
equations  with  a  complex  weighting  factor.  Thus 


q{s)  =  qz(s)  +  e  q  i(s) 
p(s)  ~ p2(s )  +  epj(s) 


and  e  =  d.  -  ib  is  described  in  an  earlier  section.  With  b  strictly  greater 
than  zero,  then  a  beam  is  defined  in  which  q(s)  *  3  .  This  uses  the  fact 

that  the  deterrr’nant  of  the  matrix  formed  by  the  two  solutions. 

is  an  invariant  of  the  motion,  and  is  sometimes  called  the  Lagrange 
invariant.  Cerveny  et.  al.  [1982]  uses  two  independent  solutions  with  initial 

conditions.  £  1/°Vq 

correspond  to  an  initial  planar  wavefront  with,  P\/q\  =  0.  The  initial 
conditions,  <72(°)=0  and  P2(°)  =  1/'uo  correspond  to  an  initial  point 
source,  with  Pz/  qz  -*30  The  source  region  is  usually  assumed  to  be  locaily 
homogeneous. 


The  initial  conditions,  g j (0)  =  1  and  pt(0|  =  0  , 


<7 1  <72 
Pi  P  2j’ 
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Madariaga  [1984]  constructed  a  beam  solution  in  terms  of  a  small 
perturbation  of  a  constant  Snell  wave  having  a  constant  horizontal  ray 
parameter.  For  a  source  imbedded  in  a  vertically  varying  medium,  he 
found  that  the  (gi.Pi)  solution  must  be  modified  in  order  to  specify  a 
constant  Snell  wave,  thus 


<7i(0)  =  1 


Pi(0) 


-Pz2^,»(0) 

COSljPo 


(0) 


where  (pQ  is  the  initial  takeoff  angle  of  the  ray  from  the  vertical  ar.d 
Pj  =  sin^o/ vo  is  the  horizontal  ray  parameter  which  in  a  vertically 
varying  medium  is  constant  along  a  ray.  The  (q2>Pz)  solution  is  the  same 
in  eaoh  case.  For  a  homogeneous  source  region,  where  u,(0)=0, 
Madariaga's  constant  Snell  wave  solution  is  the  same  as  the  previous  plane 
wave  solution.  When  £-*«>,  Madariaga's  beam  superposition  in  a 
vertically  varying  media  reduces  to  the  standard  WKBJ  result. 


Madariaga’s  initial  conditions  in  a  vertically  varying  medium  is  shown 
by  imbedding  a  thin  homogeneous  layer  within  the  gradient  as  in  figure 
(3a).  A  source  is  put  in  the  constant  velocity  layer.  Boundary  conditions 
are  then  matched  on  the  top  and  bottom  interfaces  and  the  layer 
thickness  shrunk  to  zero.  Boundary  conditions  for  p  and  q  at  an 
interface  are  given  by  Cerveny  and  Hron  [1979]  derived  using  the  phase 
matching  principle.  The  2-D  results  are  given  in  a  compact  form  by  Cerveny 
[1981]  as 
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l  0 
c  +  In  COS2T? 

cos2t? 

where  (q ,p)  are  the  incident  values  and  (gf,p)  are  the  transmitted  or 
reflected  values.  The  upper  sign  is  for  the  transmitted  case  and  the  lower 
sign  for  the  reflected  case.  The  angles  are  defined  in  figure  (3b).  The 
effects  due  to  interface  curvature,  represented  by  C  ,  and  the  effects  due 
to  velocity  heterogeneity,  represented  by  In  ,  are  additive,  where 


COST?  f 


?  =  (+/-)  — 
?  K '  COST? 


where  G  is  the  local  interface  curvature  at  the  point  of  incidence,  where 

*♦ 

G  is  positive  if  the  interface  is  convex  as  seen  by  the  incident  ray.  L  is 
the  local  tangent  to  the  ray  and  m  is  perpendicular  to  it  as  seen  in  figure 
(3b). 


In  the  case  of  a  ray  striking  the  lower  horizontal  interface  as  in  figure 
(3b),  then 


= -U  jSim?  V  i  —  v  z  cost? 

Also,  since  v  -  v  at  the  interface,  then  t?  =  $  by  Snell's  law  The 
interface  curvature  tern?  is  zero,  C  =  0  ,  and  the  velocity  inhomogeneity 
-v  .  sin2T? 

term  is  In  -  — - -  then 

itcost? 
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7=7 


P  =P'- 


u  z  sin2T? 
v2cost3 


bin  inking  the  layer  thickness  to  zero  results  in  the  modified  initial 
conditions  for  (<7 i.p  1)  as  in  eqn  (8).  For  a  vertical  ray,  these  conditions 
reduce  to  Cerveny’s  initial  conditions.  For  a  horizontal  grazing  ray,  the 
boundary  conditions  at  the  interface  break  down. 


Part  of  the  discrepency  in  the  two  initial  conditions  for  (7  ijo  1)  is  that 
an  initial  plane  wave  imbedded  in  a  medium  with  a  vertical  velocity 
gradient  is  not  a  wave  with  a  constant  horizontal  ray  parameter. 
Madariaga's  initial  conditions  for  (7i,Pi)  construct  an  initial  constant 
Snell  wave.  A  constant  Snell  wave  has  a  caustic  or  focus  at  its  bottoming 
point.  This  results  in  a  problem  when  using  Madariaga’s  beam 
superpo.  ition  for  an  initial  ray  that  is  exactly  horizontal  for  which  both 
the  (g2, p2)  solution  and  the  ( 7 1  .p  1 )  solution  are  singular  and  thus  no 
longer  independent.  In  the  WKBJ  method,  the  contributions  from 
horizontal  rays  are  typically  windowed  out  (see  Sinton  and  Frazier  [  1 982 


By  using  Cerveny’s  initial  conditions,  two  independent  solutions  are 
always  guarranteed.  The  disadvantage  of  this  approach  is  that  the  (gj.jDp 
may  no  longer  correspond  to  a  constant  Snell  wave  in  the  global 
coordinates.  Also,  since  the  weight  function  in  the  beam  superposition  is 
derived  by  matching  to  a  known  solution  in  a  homogeneous  source  region, 
then  in  an  arbitrary  source  region,  the  weight  function  will  only  be 
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approximate.  Beydoun  and  ben  Menahem  [1985]  used  Cerveny’s  initial 
conditions  in  various  gradients  and  compared  with  exact  analytic 
solutions.  They  showed  that  good  aggreement  occurred  provided  that 
several  high  frequency  conditions  were  satisfied.  In  addition,  the  Gaussian 
beam  superposition  was  found  to  have  more  restrictions  than  the  ray 
theoretical  solution.  This  may  be  in  part  due  to  the  approximate  weight 
function  used  in  the  Gaussian  beam  superposition. 

Several  examples  are  now  done  using  Cerveny’s  initial  conditions,  C.i.c., 
and  Madariaga’s  initial  conditions,  M.i.c.,  in  a  vertical  gradient.  In  the  first 
example  a  surface  source  is  used.  The  medium  has  a  linear  gradient  with 
velocities  from  1.5  km/sec  to  5.6  km/sec  fromO  to  10  km.  A  Gabor  wavelet 
with  /  =  4/iz  and  y  =  3  was  used  (see  Nowack  and  Aki  [  L 984]  eqn.  (22)). 
Results  in  the  time-distance  domain  are  shown  in  figure  (4)  with  the  C.  i.c. 
shown  on  the  left  and  the  M.  i.c.  shown  on  the  right.  Two  initial  beam 
widths  are  used,  L0  =  5  kml/z  and  Lq  =  15  kml/z  ,  where  L0  =  5  kml/z  is 
near  an  average  critical  value.  The  C.  i.c.  for  L0  =  15  kml/z  has  noise 
pedominantly  in  front  of  the  geometric  arrival.  The  M.  i.c.  for 
Lq  -  15  kmx/z  has  noise  later  in  time  than  the  geometric  arrival. 

In  figure  (5),  the  contributions  from  each  beam  is  shown  for  a  station 
at  12  km.  The  same  model  as  in  figure  (4)  is  used.  The  rays  are  numbered 
from  l  to  20,  from  shallow  to  steeply  dipping  at  the  source.  The  ray 
number-phase  time  plots  are  shown  for  both  initial  conditions  and  initial 


VlVkliV.W  lAVIVIVlViV 


Figure  5  Ray 
gradient  w 
i.c.,  and  tw 


LINEAR  GRADIENT 


>  • 

129 


beam  widths,  Lq  =  5,  15  hmw2  .  The  resultant  seismogram  in  each  case  is 
shown  by  the  scaled  sum  on  the  right  of  each  sub-plot  in  figure  (5).  In  all 
sub-plots,  the  geometric  ray  to  the  station,  ray  number  12,  is  a  stationary 
point.  A  significant  difference  between  the  C.  i.c.  and  the  M.  i.c.  is  the 
curvature  of  the  beam  arrivals  near  the  geometric  arrival.  Using  the  C.  i.c., 
•the  beams  near  the  geometric  arrival  come  in  earlier,  while  using  the  M. 
i.c.,  the  beams  near  the  geometric  arrival  come  in  later  than  the  geometric 
arrival.  This  can  be  understood  by  noting  that  using  the  C.  i.c.  no  qx  or 
<72  caustics  occur  along  the  ray.  Thus  the  curvature  of  the  beams  should 
be  in  the  same  sense  as  in  a  homogeneous  medium,  in  which  the 
surrounding  beams  come  in  earlier  than  the  geometric  arrival  (see  Nowack 
and  Aki  [1984]  figure  (5b)).  Several  later  arrivals  are  seen  for  C.  i.c  . 
Lq  =  15  kml/z  ,  but  this  is  due  to  an  artificial  q{  caustic  caused  by 
projecting  these  rays  above  the  free  surface.  Using  the  M.  i.c.,  all  beams 
have  passed  through  a  caustic  at  the  bottoming  point  of  each  ray 
which  changes  the  curvature  of  the  beams.  The  C.  i.c.  has  no  bottoming 
point  caustic. 

Another  difference  between  the  two  initial  conditions,  C.  i.c.  and  M.  i.c.. 
is  the  phase  of  the  individual  beam  wavelets.  There  is  a  phase  delay  for  the 
C.  i.c.  and  a  phase  advance  for  the  M.  i.c..  In  each  case,  the  phase  of  the 
individual  beam  wavelets  depends  on  the  initial  beam  width  used.  In  all 
cases,  the  resulting  sum  is  approximately  zero  phase.  Thus  in  each  case, 
the  individual  beam  phase  shifts  is  offset  by  the  curvature  of  the  beams 
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away  from  the  geometric  arrival.  This  occurs  in  such  a  way  as  to  give  a 
seismogram  with  no  resulting  phase  shift.  This  results  from  using  a  weight 
function  in  the  superposition  which  was  derived  by  matching  to  a  known 
solution. 

In  figure  (0),  the  time-distance  wavefields  are  shown  for  a  source  at  4. 
km  depth  in  a  linear  gradient.  The  velocity  is  L.8  km/sec  to  5.6  km/ sec 
from  0.  to  20.  km  in  depth.  A  Gabor  wavelet  with  /  =  4  hz  and  7  =  4  For 
the  M.  i.c.  there  is  now  a  small  second  arrival  for  distances  greater  than  16 
km,  using  both  initial  beam  widths,  4  and  20  Arm1/2  .  Also,  the  amplitude  is 
slightly  larger  in  the  14-16  km  range  for  the  M.  i.c.  case.  In  order  to 
investigate  this,  ray  number-phase  time  plots  are  constructed  for  several 
station  locations. 

In  figure  (7),  the  ray  number-phase  time  plots  are  shown  for  a  station 
range  of  22  km,  for  both  initial  conditions  and  initial  beam  widths.  The 
rays  are  numbered  from  l  to  20  from  shallow  to  steeply  dipping  at  the 
source,  as  shown  in  the  above  ray  diagram.  The  geometric  arrival  is 
specified  by  ray  number  17.  For  the  C.  i.c.,  the  beams  near  the  geometric 
beam  arrive  earlier  in  time.  The  noise  in  front  of  the  geometric  arrival  in 
the  scaled  sum  for  Lq  -  20  Arm 1/2  can  be  reduced  by  using  more  beams  m 
the  supe/position.  For  the  M.  i.c.,  the  beams  nearby  the  geometric  arrival, 
ray  number  17,  come  in  later  in  time.  But,  another  stationary  point  occurs 
at  ray  number  9.  This  is  a  maximum  stationary  point,  and  has  lower 
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Figure  6  Time  distance  wavehelds  for  a  source  at  4  km  depth  in  a  linear 
gradient  for  two  initial  conditions.  C.  i.c.  and  M.  i.c.,  and  two  initial 
beam  widths.  L0  -  4.20  km1/z  . 
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amplitudes.  It  corresponds  to  the  ray  that  takes  off  horizontally  at  the 
source.  There  is  a  separation  between  rays  that  are  upgoing  at  the  source 
and  those  that  go  through  a  bottoming  point  and  pass  through  a  qj 
caustic.  Since  the  contributions  don’t  quite  go  to  zero  at  this  secondary 
stationary  point  and  don’t  phase  cancel,  an  artifact  occurs  in  the  resulting 
sum.  This  artifact  occurs  for  both  Lo  =  4kmi/z  and  Lq  =  20  Aim 1/2  . 
Again,  in  the  WKBJ  result  these  ray  numbers  are  typically  windowed  out. 
This  would  not  be  a  usefql  remedy  if  the  geometric  arrival  is  near  this 
horizontal  ray  as  in  the  Imperial  Valley  example  to  follow. 

In  figure  (8),  a  similar  set  oi  ray  number-phase  time  plots  are  shown 
for  a  station  at  15  km  In  this  case,  the  geometric  arrival  is  at  ray  number 
13.  This  is  much  closer  to  the  horizontal  ray  at  ray  number  9.  For  the  M. 
i.c..  the  secondary  stationary  point  doesn't  cause  a  spurious  arrival,  but 
now  interferes  directly  with  the  main  pulse.  Modifying  the  initial  beam 
width  doesn't  alleviate  the  problem 

Madariaga  [1904]  considered  a  beam  superposition  as  a  small 
perturbation  of  a  WKBJ  superposition  with  respect  to  horizontal  ray 
parameter.  This  approach  loses  the  advantage  of  using  two  independent 
solutions  for  (p,q )  for  a  horizontal  ray.  Madariaga's  initial  conditions 
cause  a  spurious  phase  for  the  rays  near  horizontal  which  cant  be 
eliminated  by  changing  the  initial  beam  width.  Cerveny's  initial  conditions 
are  quaranteed  to  give  two  independent  solution  for  (p.q)  ,  but  doesn't 
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Figure  0  ray  number-phase  time  plots  for  a  source  at  4  km  depth  in 
linear  gradient  with  a  station  at  15  km.  Two  initial  conditions,  C.  i 
and  M.  i.c..  and  two  initial  beam  widths.  Lq  =  4.20  ton 1/2  are  shown. 
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correspond  to  a  small  perturbation  of  the  WKBJ  solution  in  an 
inhomogeneous  source  region.  In  addition,  since  the  weight  function  is 
derived  for  a  homogeneous  source  region,  it  will  only  be  approximate  using 
the  Cerveny  initial  conditions  in  an  inhomogeneous  region.  The  major 
difference  between  the  two  initial  conditions  is  in  defining  a  local  plane 
wave  in  the  ray  centered  coordinates  versus  defining  a  constant  Snell  wave 
in  the  global  coordinates. 

2.3.7  SEISMOGRAMS  IN  THE  IMPERIAL  VALLEY  VELOCITY  STRUCTURE 

In  this  example,  Gaussian  beam  seismograms  are  computed  using  the 
Imperial  Valley  velocity  structure.  An  interesting  feature  of  the  recorded 
strong  ground  motion  for  the  1979  Imperial  Valley  earthquake  was  the 
large  vertical  accelerations  at  some  but  not  all  sites  (see  Archuleta  [1982J. 

For  example,  at  the  station.  E06,  the  peak  acceleration  was  1705 
cm/ sec2.  Several  explanations  for  this  localized  region  of  high 
acceleration  have  been  considered,  including  the  source  rupture  process, 
and  a  path  effect  due  to  the  velocity  gradient  in  the  upper  5  km  of  the 
Imperial  Valley.  Here  we  explore  only  the  path  effect.  A  likely  candidate 
for  the  large  acceleration  resulting  from  a  path  effect  is  the  PP  phase 
which  was  a  prominent  arrival  recorded  during  the  1979  refraction  survey 
of  the  Imperial  Valley  (see  McMechan  and  Mooney  [1900],  and  Fuis  et.  al. 

[1982]).  Because  of  the  focusing  effect  due  to  the  multiple  bounce,  the  PP 
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phase  has  a  localized  high  amplitude  over  a  limited  distance  range 
Archuleta  [1982]  found  that  the  travel  times  of  the  observed  high 
accelerations  were  consistent  with  a  PP  phase  originating  from  a  source 
point  at  4.0  km  depth  and  5.5  km  north  of  the  hypocenter,  although  he  now 
attributes  the  main  cause  of  localized  high  acceleration  to  a  source  effect. 

The  Imperial  Valley  structure  derived  from  the  refraction  survey  is 
shown  in  figure  (9).  The  ray  diagram  for  the  PP  phase  is  also  shown,  in 
figure  (10),  seismograms  computed  using  the  Gaussian  beam  method  are 
compared  to  seismograms  computed  using  the  Reflectivity  method,  without 
attenuation,  by  R.  Archuleta.  The  Gaussian  beam  method  uses  the  Cerveny 
initial  conditions  described  in  the  previous  section.  A  significant  portion  of 
the  energy  comes  from  nearly  horizontal  rays  at  the  source.  The  source  ;s 
modelled  using  a  point  double  couple  radiation  pattern  with  a  dip  of  75 
degrees,  a  0.0  degree  slip  angle,  and  the  seismic  section  13  degrees  off 
strike.  A  Gabor  wavelet  with  a  frequency  of  6  hz  was  used  in  the  Gaussian 
beam  seismograms.  To  obtain  the  Gaussian  beam  result  for  a  point  source, 
the  GB  results  were  multiplied  by  a  ray  method  correction  given  by 
Cerveny  et.  al.  [1982].  The  value  of  L0  was  set  at  2.0  fern 1/2  resulting  in 
average  beam  widths  at  the  receivers  of  about  2  km.  Both  seismic  sections 
show  the  predominant  high  amplitudes  of  the  PP  phase  in  the  distance 
range  of  15-18  km.  with  the  Gaussian  beam  result  being  slightly  closer 
Significant  S  wave  energy  is  seen  in  the  Reflectivity  section.  The 
Reflectivity  section  shown  in  Archuleta  [1982]  included  the  effects  of 
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attenuation  which  diminished  the  S  waves,  and  further  enhanced  the  PP 
arrival. 

Sa0  PLANE  WAVE  DECOMPOSITION 

In  this  section,  a  simple  comparison  is  given  between  the  individual 
beam  contributions  in  the  Gaussian  beam  superposition  and  a  slant  stack 
of  a  time-distance  wavefield.  The  operation  of  slant  stacking  is 
kinematically  equivalent  to  a  plane  wave  or  r-p  decomposition.  In 
mathematical  notation  this  is  also  known  as  an  inverse  Radon  transform. 

The  plane  wave  decomposition  holds  a  central  role  in  the  velocity  » 

analysis  in  vertically  varying  media.  The  r- method  for  velocity  analysis 
was  used  for  example  by  Bessonova  et.  al.  [1974,  1976].  McMechan  and 
Ottolini  [I960]  showed  that  T-p  curves  can  be  directly  observed  in  slant 
stacked  wavefield  data.  Clayton  and  McMechan  [1981]  performed  an 
iterative  migration  to  convert  the  t-jd  wavefield  to  a  velocity  depth 
estimate.  This  was  applied  to  the  upper  mantle  structure  by  Walck  and 
Clayton  [1984]. 

Using  the  r-p  wavefield  has  the  advantage  that  triplications  are 
unwrapped  and  thus  made  more  easily  interpretible.  Also,  in  a  vertically 
varying  medium,  each  r-p  trace  corresponding  to  a  ray  parameter,  p, 
satisfies  a  1-D  wave  equation.  The  relationship  between  the  t-jd  wavefield 
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and  seismogram  synthesis  has  been  investigated  by  Chapman  [  1978.1901, 
and  Treitel  et.al.  [1902].  For  a  line  source  in  a  vertically  varying  medium 
Chapman  [1901]  gives  the  Radon  transform  formulas  for  wavefleld 
decomposition  and  synsthesis  as 

os 

=  f  u(T+px,x)dx  (9a) 

and 

m 

u(t,x)  =  f  ^-u+(f  ~PX>P)  <*P  (9b) 

where  {2+  is  the  time  derivative,  Hilbert  transform  of  u  .  These  are 
reversible  linear  transformations  which  map  a  line  in  one  domain  to  a 
point  in  the  other  domain.  Each  trace  of  tt(T,p)  represents  an  individual 
plane  wave,  or  in  a  vertically  varying  medium  a  constant  Snell  wave  since  in 
general  they  won’t  be  planar. 

An  intermediate  step  in  the  Gaussian  beam  method  is  the  evaluation  of 
each  beam  contribution  along  a  specific  ray  arriving  at  a  given  station. 
Each  beam  satisfies  a  local  wave  equation  along  each  central  ray.  The  final 
GB  solution  using  the  slowness  method  is  a  superposition  of  the  beam 
contributions.  Thus 

\/v 

u(t, x)  -  f  ugb{s,n,t)dp  t L 0) 

-\/v 

where  Ugb  is  an  individual  beam  contribution,  and  the  integral  is  over  the 
ray  number  measured  at  the  source.  In  the  Gaussian  beam  method  this 
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integral  is  usually  only  over  real  angles.  •  Using  large  initial  beam  widths  in 
a  vertically  varying  medium,  then  each  beam  will  approach  an  initial  plane 
wave  contribution  as  in  equation  (9b). 

A  simple  comparison  is  now  done  between  a  slant  stacked  wavefield 
and  the  individual  beam  contributions  in  equation  (9b).  The  velocity  model 
consists  of  a  15  km  homogeneous  layer  with  a  velocity  of  5.6  km/ sec  over  a 
linear  gradient  from  5.6  to  8.0  km/sec.  A  Gabor  wavelet  is  used  with 
/  =  4  hz  and  7  =  3.  The  ray  diagram  is  shown  in  figure  (11a).  A  reduced 
time  wavefield  is  shown  in  figure  (l  lb).  In  order  to  use  the  same  format  as 
the. Gaussian  beam  decomposition,  each  ray  number  is  simply  counted 
from  l  to  18  from  shallow  to  steeply  dipping  at  the  sorce  as  in  figure  (l 2a). 

A  station  distance  of  120  km  is  used.  Equation  (9a)  is  then  applied  to 
wavefield  in  figure  (lib)  resulting  in  a  S(r,p)  wavefield.  This  plane  wave 
decomposition  is  shown  in  figure  ( 1 2b)  in  the  ray  number  format  for  a 
station  at  120  km  The  slight  phase  shift  shown  on  the  first  several  traces 
may  be  the  result  of  using  a  finite  x  aperture  in  the  slant  stack.  Filtering 
this  plot  with  ray  number  corresponds  to  a  beam  forming  operation  in 
which  energy  from  certain  directions  is  enhanced  with  respect  to  other 
directions. 

Figure  (12c)  shows  the  ray  number-phase  time  plot  for  the  Gausshn 
beam  synthesis  using  a  large  initial  beam  width  of  Lq  -  128  km l/~  .  For 
this  initial  beam  width,  some  amplitude  filtering  is  still  seen  for  the  first 
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LAYER  OVER  GRADIENT 


Figure  1!  a)  ray  diagram,  and  b)  reduced  Lime-distance  wavefleld  for  a 
layer  over  a  gradient  model 
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LAYER  OVER  GRADIENT 


2tt 


Figure  12  a)  ray  digram  for  ray  number-phase  time  plots  for  a  layer  over  a 
gradient  model  with  a  station  at  120  km.  b)  slant  stack  of  wavefleld  in 
figure  (lib),  c)  Gaussian  beam  decomposition  for  L0  =  120  /bn 1/2  . 
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several  ray  number  traces.  The  resulting  scaled  sum  for  a  range  of  120  km 
is  shown  on  the  right  of  figure  (12c)  corresponding  to  the  synthesis 
operation  in  eqn  (10).  Comparing  figures  (12b)  and  (12c)  shows  that  the 
locus  of  maximum  amplitudes  map  out  the  same  curve  for  the  slant  stack 
and  the  Gaussian  beam  decomposition.  It  is  interesting  that  the  individual 
wavelets  in  figures  (12b)  and  (12c)  have  similar  phase  shifts.  This  results 
from  the  approximations  in  using  the  Gabor  wavelet  in  the  Gaussian  beam 
method,  in  particular  for  the  Hilbert  transform. 

Since  the  slant  stacking  operation  is  a  linear  beam  forming  operation, 
an  important  extension  of  this  is  to  investigate  the  variations  of  the  ray 
parameter-phase  time  wavefields  in  laterally  varying  media.  This  has  been 
investigated  for  several  structures  by  Treitel  et.  al  [1982].  Because  if  its 
usefulness  in  vertically  varying  media,  plane  wave  or  beam  decompositions 
are  anticipated  to  be  important  data  representations  in  more  complicated 
media.  This  representation  is  straightforward  to  generate  as  a  by  product 
of  many  seismogram  synthesis  routines,  in  particular,  as  ray  Entered 
beam  responses  in  the  Gaussian  beam  method. 


2.3.9  Q1  CAUSTICS 

Previous  results  have  indicated  that  in  decomposing  a  point  source 
into  beams  that  stable  results  occur  for  large  initial  beam  widths.  Nowack 
and  Aki  [1984]  suggested  that  the  critical  value  of  Lq  be  used  to  separate 
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large  from  small  initial  beam  widths,  where  for  S0  =  0  ,  4r  =  1 9z/  q  1 ! 1/2  • 
When  initial  beam  widths  smaller  than  critical  are  used,  then  phase 
interference  and  amplitude  decay  only  weakly  restrict  the  number  of 
beams  contributing  at  a  given  station.  A  finite  aperture  will  then  result  in 
a  reduced  amplitude  of  the  resulting  seismogram. 

A  g  i  caustic  is  defined  where  gj  =0  in  the  wavefleld.  For  a  station 
at  a  gj  caustic,  then  Thus  the  initial  beam  width  at  the  source 

resulting  in  the  narrowest  beam  width  at  the  receiver  approaches  infinity, 
i.e.  a  plane  wave.  However,  in  this  case  using  a  slightly  smaller  initial  beam 
width  should  still  give  a  satisfactory  amplitude,  since  the  aperture  of  rays 
will  still  be  large  compared  to  the  focused  energy  at  the  qx  caustic. 
Typically  an  average  value  of  Lq  is  used  for  the  suite  of  rays  in  the  vicinity 
of  the  station.  This  would  then  smooth  over  the  beam  sc’utions  near  the 
station.  Thus  at  a  gj  caustic,  the  finite  aperture  should  not  pose  a 
problem  for  a  suitably  large  but  finite  Lq  .  However,  a  finite  L0  will 
average  over  nearby  rays  and  lose  some  spatial  resolution. 

In  surface  wave  ray  tracing,  Yomogida  and  Aki  [1985]  found  a  g, 
caustic  along  the  equator  for  a  high  latitude  source  when  using  a  Mercator 
mapping  from  spherical  to  cartesian  coordinates.  They  found  that  as  long 
as  L0  was  large  enough  to  avoid  truncation  errors  from  the  aperture,  then 
the  amplitude  was  accurate  to  1%.  But,  an  increasing  amount  of  averaging 
of  surrounding  beams,  without  the  aide  of  phase  interference,  is  seen  in 
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figure  (4)  of  Yomogida  and  Aki  [1985].  A  slight  increase  in  the  beam  width 
at  the  qx  caustic  to  be  several  times  the  ray  spacing  should  stabilize  the 
solution.  Another  possible  way  of  avoiding  the  problems  associated  with  a 

I 

qi  caustic  is  to  use  the  real  part  of  e  ,  S0  ,  to  shift  the  beam  waist  along  ' 

. 

the  ray.  But,  this  will  also  introduce  an  additional  free  parameter. 


I 

2.3. 10  GAUSSIAN  BL  A&S  INCIDENT  ON  A  PLANAR  INTERFACE  j 

i 


In  this  section,  the  asymptotic  analysis  of  a  beam  reflected  on  a  planar 
interface  is  given,  following  the  development  of  Ra  et.  al.  [1973].  The 
saddle  point  contribution  to  the  reflected  field  is  given  by 


u(is,zSicj)  * 


/?(9)  ikiDr  -  in/  4 


dO 


where  kxDr  »  l  .  Dr  is  the  distance  from  source  to  receiver,  and  /?(0)  is 
a  plane  wave  reflection  coefficient.  As  shown  in  a  previous  section,  the 
reflected  beam  contribution  can  be  gotten  by  using  a  complex  source 
point.  Using  the  image  source  point  as  in  figure  (13),  then 

x  -*  rftsina 
z  -*  20  —  ib  cos  a 

zs  —  z 

and  the  angle  0  is  defined  from  sin0  =  — - - .  When  6  -»  3  ,  then 

U r 

Dr  -*  pr  and  0  -»  i?r  in  figure  (13). 
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Figure  13  geometry  for  beam  incident  on  a  plane  interface.  The  angie,  a  , 
corresponds  to  the  reflection  of  the  beam  axis.  The  angle,  i)r  ,  is  the 
geometric  relection  from  the  source  to  the  station.  tfe  is  the  critical 
angle. 


For  t3  >  t3c  =  sin"1^  \/v2)  ,  then  in  addition  to  the  reflected  field  there 
will  also  be  a  diffracted  head  wave  field.  In  the  vicinity  of  t?c  ,  a  uniform 
approximation  must  be  used.  Going  to  a  Gaussian  beam  by  using  a 
complex  source  point  requires  using  the  appropriate  complex  angles  in 
these  formulas.  Here  only  the  reflected  field  will  be  investigated.  In  the 
Gaussian  beam  method  as  currently  implemented,  the  rea.  reflection 
coefficient  for  the  beam  axis  is  used,  and  all  ray  tracing  is  done  along  the 
beam  axis.  Thus  in  the  reflection  coefficient,  0  is-replaced  by  a  . 

The  reflection  coefficient  in  the  acoustic  case  has  a  slightly  different 
form  than  that  for  the  SH  case.  The  reflection  coefficient  in  the  acoustic 
case  is  given  by 

For  the  SH  case,  the  reflection  coefficient  is  given  by 

RshW  = 

This  difference  in  reflection  coefficient  is  related  to  the  different  type  of 
impedance  in  the  acoustic  and  the  SH  cases  (see  for  example  AKi  and 
Richards  [1980],  Box  5  4,  or  Aki  [1970],  lecture  notes  pp.  60-61) 


vi  ■> 

p2t2c osi?  1 - msin-w 


2 

p2v2cost?  ■+*  Pit' i  l  -  — |-sin2,i3 


It  is  assumed  that  the  beam  is  well  defined  with  k{b  »  l  .  In  addition, 
the  paraxial  region  is  considered  with  »  a  .  The  reflection  coefficient  is 
then  approximated  as 


R(®)*»R(a)  +  fl'(tf)  !*  »a  Atf 
R{Q)  «  R(a) 


R(D) 
R(Q)  **R(a)  ev 


where  u  is  a  complex  phase  correction  which  has  the  effect  of  relocating 
the  maximum  of  the  reflected  field  (see  Ra  et.  al.  [1973],  Felsen  [l984j. 


In  the  acoustic  case,  then 

R'W  !  4  =  a 
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where  Xa  ~ 
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For  pj  =  p2  ,  then  \a  - — 


the  acoustic  approximation.  In  a  similar  manner,  for  the  SH  case 


R(a) 


2  sina  xsh 


vz  ■  2 
1 - sin2a 


T7T 


where  xsh  = 


Pi  ^ 

Pzvz 

[4->| 

^1 

(22  2 
p  \V  1  0 

cos -a  +  -4- 
p|u|  vf 

sin2a  -  1 

The  complex  beam  shift  is 
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For  a  real  source  point 


Ai?r  =  sin"1 


n 

Pr 


sa 


n 


For  a  complex  source  point,  Ai?r  -*  Atfr  and  pT  -»  Dy.  »  (pr  -  ib )  ,  then 

Pr 


Ai?r  »  — 


n 


—  sinAi?r 


pr  -ib  pr  -ib 

The  formula  for  the  complex  beam  shift  correction,  u  ,  is  then  given  by 


u  =  2  s  in  a  x 


pr  +  ib 


1  - 


v§  ■  2 

—5-  sura 


T7T 


prsinAtfr 


Pr*  +  *4 


As  6  -*  0  ,  this  is  a  real  perturbation.  For  6  -*« ,  then  the  beam 
approaches  a  plane  wave  and  v  goes  to  0.  .  The  Gaussian  beam  result  is 
modified  by  a  first  order  coriection  to  the  beam  axis  reflection  coefficient 
at  an  interface  by  a  factor.  eu  . 

In  order  to  see  how  the  factor  ew  shifts  the  maximum  of  the  reflected 

:W"B 

beam  amplitude,  the  maximum  of  the  factor  e  is  found  where 


rZ-g b_ 
L  ~  ic 
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and  ,fl  =  asiosLi 
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This  is  analogous  to  the  formulas  given  in  Ra  et.  al.  [1973].  Note  that  the 
beam  shift  goes  to  zero  for  k  -*»  .  This  is  sometimes  used  to  justify 
neglecting  this  factor  in  high  frequency  beam  propagation. 

As  a  simple  .example,  consider  a  surface  source  in  a  35  km  thick  crust 
with  Pi  =  3.5  km/ sec  and  px  =  2.74  g/cm3  and  0Z  =  4.6  km/ sec  and 
p2  =3.3  g/cm3  .  The  critical  angle.  .  is  49.54  degrees  and  a  critical 
distance  of  02.  km  Let  angle,  a  of  the  beam  axis  be  20.  degrees  with  a 
range  of  25.47  km  The  angle.  ,  corresponding  to  a  range  of  40  km  is 
29.74  degrees.  pr  is  80.62  km.  The  critical  value  of 
6  =L$  =  |g2/gr|  -  pr  is  used.  For  /  =  5  hz  .  n ^  =  -  09  km  ,  or  about 
an  eighth  of  a  wavelength.  For  larger  6  .  this  value  would  be 
correspondingly  less  in  the  pre-critical  range. 
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2.3.11  FINITE  FREQUENCY  EFFECTS  OF  AN  INDENTATION  ON  AN 
INTERFACE 

In  this  final  section,  the  scattering  effects  due  to  an  indentation  on  an 
interface  is  investigated.  This  was  first  studied  by  Larner  [t970]  for  a 
dented  Moho  under  the  Montana  large  aperture  seismic  array  (LASA). 
Larner  [1970]  used  the  discrete  wavenumber  technique  to  compute  the 
amplitude  and  phase  perturbations  of  a  single  frequency  signal  impinging 
at  different  angles  on  various  interfaces  indentations.  He  found  a 
significant  frequency  variation  in  the  scattered  signal.  Bard  and  Bouchon 
[1980]  extended  this  to  the  analysis  of  transient  signals. 

The  model  geometry  is  shown  in  figure  (14a).  W  and  h  are  the  width 
and  height  of  the  indentation.  The  observation  plane  is  25  km  above  the 
interface.  The  upper  half  space  has  ^  =  3.0  km/ sec  and 
Pi  =  2.8  km/ sec  ,  and  the  lower  half  space  has  u2  =  4.0  km/ sec  and 
p2  =  3.3  g/ cm3  .  An  SH  wave  is  incident  from  below  with  an  angle  of  32 
degrees  from  the  vertical.  A  Gabor  wavelet  is  assumed  with  y  -  4  and 
several  central  frequencies.  The  peak  amplitude  of  the  time  domain  trace 
is  then  picked  along  the  observation  plane. 

The  results  using  the  Gaussian  beam  method  are  compared  with  the 
time  domain  implementation  of  the  Aki-Larner  discrete  wavenumber 
technique  (see  Bard  and  Bouchon  [I9r0]).  The  peak  amplitudes  alorg  the 
observation  plane  are  shown  in  ngure  (14).  Figures  (L4b-e)  show  the 
amplitude  patterns  for  an  increasing  indentation  height  /i  fromO  km  to  5 
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INDENTATION  ON  AN  INTERFACE 


Figure  14  Amplitude  patterns  lor  a  plane  wave  incident  on  an  indented 
interface,  a)  model  geometry,  b-e)  A  =  20  km  incident  wavelet  for 
indentation  height,  h  =  0,  l,  2.5,  5.0  km  .  f-g)  A  =  10  km  incident 
wavelet  for  indentation  height,  h.  =  2.5,  5.0  km  . 


km  for  a  pulse  with  a  20  km  incident  wavlength.  The  solid  line  is  the 
amplitude  pattern  computed  using  the  Gaussian  beam  method  and  the 
dotted  line  is  the  amplitude  computed  using  the  Aki-Larner  discrete 
wavenumber  method.  The  center  of  the  indentation  is  located  at  70  km 
As  the  indentation  height  increases,  the  amplitude  pattern  increases  with 
an  asymmetrical  appearance.  The  Gaussian  beam  solution  consistently 
shows  larger  side  lobes  than  the  discrete  wavenumber  solution.  The 
difference  may  be  due  to  the  fact  that  the  Gaussian  beam  method  is  a  hign 
frequency  method  and  the  Aki-Larner  method  is  a  low  frequency  method. 
The  differences  in  the  solutions  is  probably  an  artifact  of  using  the 
Gaussian  beam  method  for  the  low  frequencies  of  this  example.  However, 
the  width  of  the  main  peak  of  the  amplitude  pattern  is  comparable  for  the 
two  methods.  In  figures  (14f-g),  the  amplitude  anomaly  for  the  two 
methods  is  shown  for  indentation  heights  of  2.5  and  5.0  km  with  an 
incident  wavelength  of  10  km.  The  main  amplitude  peak  is  narrower  for 
this  case  than  for  the  20  km  wavelength  case  and  is  also  higher  in 
amplitude.  There  is  thus  a  frequency  dependence  of  the  scattering 
anomaly.  This  was  also  observed  by  Larner  [1970].  A  simple  use  of  ray 
theory  for  an  example  such  as  this  would  never  reveal  the  frequency 
dependent  nature  of  the  scattering  The  high  but  still  finite  frequency- 
methods  like  the  Gaussian  beam  method  can  be  used  to  bring  out  some  of 
these  finite  frequency  effects. 


2.3.12  CONCLUSIONS 


Several  topics  in  Gaussian  beam  synthesis  have  been  investigated  in 
this  paper  including  the  choice  of  the  beam  parameter,  e  ,  and  its  relation 
to  a  complex  source  point.  The  asymptotic  and  exact  evaluations  to  the 
Gaussian  beam  integral  shows  an  additional  singularity  which  goes  to 
infinity  as  c  -*  »  .  But,  this  typically  won't  be  computed  in  the  usual 
Gaussian  beam  synthesis.  An  initial  point  source  in  a  vertically  varying 
gradient  distinquishes  between  an  initial  Snell  wave,  which  is  singular  for  a 
horizontal  ray,  and  an  initial  plane  wave.  In  a  homogeneous  source  region 
these  are  the  same,  but  in  an  inhomogeneous  source  region  each  has 
different  advantages.  A  first  order  correction  to  the  plane  wave  reflection 
coefficients  are  then  described.  This  results  in  a  beam  shift  which  goes  to 
zero  for  high  frequency  beam  propagation.  A  simple  example  of  a  3b  km 
layer  shows  a  modest  beam  shift  for  a  5  hz  signal.  Finally,  the  finite 
frequency  scattering  effects  for  an  indented  interface  are  computed  using 
the  Gaussian  beam  method  and  shows  reasonable  agreement  with  the 
effects  computed  using  the  Aki-Larner  method. 

Further  work  using  the  Gaussian  beam  method  include  investigation  of 
the  real  part  of  the  complex  beam  parameter,  e  ,  which  can  be  used  to 
shift  the  beam  waist  presumably  to  where  the  strong  heterogeneity  is 
located.  For  small,  continuously  distributed  heterogeneities,  a  multiple 
beam  Kirchhoff  scheme  would  be  one  possible  approach.  This  would  involve 


decomposing  the  wavefleld  at  regular  intervals.  Finally,  the  development  of 
a  practical  implementation  of  the  full  3-D  Gaussian  beam  formalism  will  be 
important  in  synthetic  seismogram  modelling. 
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2.4  ITERATIVE  INVERSION  FOR  VELOCITY  USING  WAVE^RM  DATA 
2.4.1  INTRODUCTION 

In  this  paper,  an  iterative  inversion  method  for  earth  structure  is 
investigated.  Previous  inversions  for  velocity  using  array  data  have  mostly 
used  travel  times  in  constructing  the  tomography  problem  (see  Aki  [1977], 

Aki  et.al.  [1977]).  Recently  iterative  methods,  such  as  the  algebraic 
reconstruction  method,  have  been  applied  to  travel  time  inversions  (see 
McMechan  [1903],  Clayton  and  Comer  [1983],  Humphreys  et.al.  [1904]). 
Several  of  these  iterative  algorithms  are  described  by  Censor  [1901]  The 
iterative  methods  have  advantages  over  generalized  inversion  methods  . 
when  dealing  with  large  sparse  matrices,  but  can  have  convergence 
problems  when  small  eigenvalues  are  present  (Ivansson  [1983]).  Still,  there 
is  much  more  information  is  seismic  data  than  ;ust  first  arrival  travel 
times,  including  amplitudes  and  waveforms.  Figure  l  shows  a  sketch  of  a 
heterogeneous  region  with  surrounding  sources  and  receivers.  A  simple 
transmitted  waveform  is  shown. 

A  number  of  studies  have  incorporated  amplitude  and  waveform  data 
in  seismic  inversion  and  interpretation.  For  example,  spatial  variations  of 
phase  time  and  amplitude  from  teleseimic  body  waves  recorded  at  the 
Montana  LASA  were  interpreted  by  Larner  [1970]  in  terms  of  a  dented 
Moho.  Haddon  and  Husebye  [  1 978]  and  Thomson  and  Gubbins  [  1982]  did 
joint  interpretations  of  travel  time  and  amplitude  to  infer  the  structure 
beneath  NORSAR.  Waveform  inversion  using  refraction  data  assuming  a 
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vertically  varying  structure  has  been  done  by  Shaw  [1983]  Brown  [1984] 
did  a  comparison  of  travel  time  inversions  and  waveform  inversions  using 
synthetic  data  also  in  a  vertically  varying  structure.  Migration  of 
reflection  seismic  data  is  a  form  of  inversion  (see  Schneider  [1978]),  and  it 
has  been  recently  shown  to  be  kinematically  similar  to  a  linearized  Born 
inversion  (see  Tarantola  [1984b],  Miller  et.al.  [1984]).  Direct  inversions 
based  on  the  linearized  model  have  been  investigated  by  Cohen  and 
Bleistein  [1979j  and  Raz  L 198 1 J.  There  has  also  been  some  progress  in 
generalizing  exact  l-D  inversion  methods  to  higher  dimensions  (see  Newton 
[1983]). 

A  recent  formulation  by  Tarantola  [  1984a]  uses  an  iterative  linearized 

approach  and  is  decribed  in  the  next  section.  A  linear  sensitivity  operator 

must  be  derived,  and  this  can  be  done  economically  by  using  reciprocity  of 

the  Green's  function.  Several  descent  algorithms  are  described  which 

avoid  a  large  matrix  inversion.  A  fast  and  reasonably  accurate  forward 

modelling  scheme  is  required,  and  here  we  make  use  of  the  Gaussian  beam 

method  in  a  slowly  varying  mecium.  Data  errors  and  a  priori  information 

_  r 

are  incorporated  using  covariance  operators.  Diferent  types  of 
linearizations  are  possible  including  a  linearization  in  terms  of  the  field, 
the  Born  approximation,  and  a  linearization  in  terms  of  the  log  of  the  field, 
the  Rytov  approximation.  Their  relative  merits  are  discussed.  Finally 
several  numerical  examples  are  performed  using  a  linearization  in  terms  of 
the  field  and  transmitted  waveforms  in  order  to  test  the  method. 


2.4.2  LINEARIZED  INVERSION 


In  this  section  a  review  of  the  linearized  inverse  approach  is  given 
following  Madden  [unpublished  lecture  notes],  Tarantola  [1984a,  1904b, 
1904c],  and  Lailiy  [1903].  In  the  linearized  approach,  flrst  a  linear 
sensitivity  or  Frechet  derivative  operator  must  be  derived  and  then  used  in 
an  iterative  procedure  for  the  model  parameters.  In  addition,  several 
descent  algorithms  are  described  which  don’t  require  the  inversion  of  a 
large  matrix. 


where  gif'.t  f)  is  the  Green's  function  from  f  to  f  .  and  *  :s  a  time 
convolution.  This  can  be  derived  from  the  bilinear  identity  [see  Lanczcs. 
1961].  For  the  scalar  wave  equation  the  Kirchhoff  integral  is  the  boundary 
term  In  the  following,  homogeneous  boundary  conditions  will  be  assumed 


giving  zero  for  the  boundary  term.  Finally,  for  a  self  adjoint  operator  L 
with  homogeneous  boundary  conditions,  the  Green's  function  is  reciprocal 
with  respect  to  source  and  receiver  location.  Thus 

In  order  to  obtain  a  linear  sensitivity  operator,  a  perturbed  problem  is 
cc  .istructed 

|  L  +6L  +  5p  =  5  +c5 5  (3) 

or 

L{6p)  -  -6L  | p  +  dp  +  <55 

Assuming  that  6L  6p  is  small  is  equivalent  to  the  Born  approximation 
which  requires  that  the  perturbed  held  5p  be  much  smaller  than  the 
unperturbed  primary  held  p  .  To  first  order,  (3)  can  be  written 

L{fip)  =  -*l[p  +<5  S  (4) 

where  -6 L{p)  is  an  equivalent  source  term  for  the  medium  perturbations 
and  5S  is  the  term  for  the  source  perturbation:*  Here  we  -will  Investigate 
the  "inverse  medium  problem"  for  a  given  source,  thus  6S  =  0  .  The 
solution  for  the  perturbed  held  can  then  be  written  as  a  space  integration 
over  secondary  equivalent  sources 

6p  {fg.t  /,)  =  -f  d.V{?)  g(?,tfg)  *6L  [pif.tf,]  ]  (5) 

where  6L  includes  the  model  perturbations,  and  p  is  the  incident 
primary  held.  Equation  (5)  can  be  written  as 


'6) 


6p(Fg,tf l)  =  | =  F  (5l, 

where  F  is  a  linear  sensitivity  or  Frechet  derivative  operator,  and  the 
operator  6L  has  imbedded  within  it  the  model  variations.  Assuming  that 
the  primary  field  can  be  written  as  p{?,t,?s)  =  .  then 

=  -f  dV(?)  *61  [3  (?.*;?,) 

Thus  the  linear  sensitivity  operator  can  be  constructed  by  computing 
g(f.t:fs)  which  propagates  the  field  from  the  source  to  each  modelr 

point  f  ,  and  g{?,t  ?g)  which  backpropagates  the  field  back  from  the 
geophones  into  the  model  at  f  This  requires  the  computation  of 
Ng  +  Ns  forward  problems  evaluated  at  the  interior  points  of  the  model 
where  Ng  is  the  number  of  geophones  and  *V,  is  the  number  of  sources 

For  a  given  L  ,  the  perturbed  operator.  6L  .  can  be  separated  into 
operations  on  the  forward  field,  p(f,t,fs )  ,  and  perturbations  of  the  model 
parameters.  For  example  for  the  scalar  wave  equation 

i^V) 

where  the  term  in  the  brackets  is  the  linear  sensitivity  operator.  F  -  . 

av 

6v(7t)  is  the  velocity  model  perturbation  at  f,  and  p{f,tf, ,)  is  the 
second  time  derivative  of  the  primary  field  computed  from  the  source  point 
to  the  point  ?  in  the  medium  The  linear  sensitivity  operators  for  the 
elastic  wave  equation  with  respect  to  the  field  are  given  by  Tarantola 
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1  [1984c].  The  linear  sensitivity  operators  for  the  acoustic  wave  equation 

with  respect  to  the  field  are  given  by  Tarantola  [  1984a],  and  used  in  a 

. 

simple  1-D  example  in  Appendix  3. 

In  the  following,  the  linearized  problem  (5)  will  be  be  imbedded  within 
an  iterative  procedure.  The  effect  of  ignoring  the  6L  5p  term  in  the 
linearization  will  result  in  no  multiple  interactions  with  the  perturbations 
within  6L  ,  or  no  "cross  talk"  between  perturbations  during  each  iteration. 

Multiple  interactions  from  previous  iterations  are  included  via  the  Green's 
functions.  In  principle,  an  iterative  strategy  should  be  able  handle  large 
velocity  contrasts.  This  approach  requires  the  calculation  of  Green's 
functions  in  inhomogeneous  media  inexpensively  *  and  reasonably 
accurately.  Various  forward  modelling  approaches  could  be  considered 
including  ray  theoretical  methods,  the  discrete  wavenumber  method,  or 
the  finite  difference  or  finite  element  methods.  Any  inversion  method  is 
only  as  good  as  the  forward  modelling  scheme  on  which  it  is  based. 


The  forward  problem  as  a  function  of  model  parameters  is  in  general 
nonlinear  function  and  can  be  written 


P  =  f(m) 


(8) 


where  p  is  the  field  and  rfl  is  the  model  parameters.  In  general, 
Pota  */  (rftprwr'  ■  where  pobs  is  the  observed  field  and  'fipnor  is  the  a 
priori  model  based  on  previous  information.  The  objective  is  Lb  find  the 


combined  vector  p,  rfl 


that  satisfies  p  =  /(rfl )  and  is  closest  to 


1%rior  |  in  some  sense.  This  distance  could  be  defined  in  various 

ways  depending  on  the  noise  structure  of  the  problem,  including  the  Lz 
norm,  the  Lx  norm,  or  the  Lm  norm.  Here  the  weighted  Lz  norm  will  be 
considered  with  it's  associated  induced  inner  product.  This  results  in  a 
least  squares  formulation.  Thus  we  want  to  minimize  the  functional  [see 
Tarantola  and  Valette.  1982,  Tarantola,  1984a] 

=  i-f  'Pots  “/Wi2  +  -Tilpnor  !2  (9) 

where  the  1/2  has  been  introduced  for  later  convenience.  The  linear 
sensitivity  operator,  Fk  at  rfik  can  then  be  defined  as 

/(rflfc  +  6ifl)  =/(t%)  +  Fk67fik  +  0()  6iAk \z) 

The  negative  of  the  gradient  of  S(rftk)  gives  the  local  direction  of 
maximum  descent  at  rhk  ,  To  first  order  this  can  be  written 

9k  =  “V  5(m*)  =  Fk6p  -  (rn*  -  m^or)  (10) 

where  6pk  =  Pot?  -  /(^k)  .  and  Fk  is  the  adjoint  of  the  iincar  sensitivity 
operator,  Fk  The  adjoint  is  defined  from  the  bilinear  identity 

<6p,F6rh>ip  -  <F'6p,rh>arn  =  boundary  term  (II) 

where  <  ,  >  is  a  defined  inner  product.  The  boundary  term  is  assumed  to 
be  zero  using,  homogeneous  boundary  conditions.  Thus.  F  is  a  linear 
mapping  from  model  to  data  space,  and  the  adjoint,  F* ,  is  a  linear 
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mapping  from  data  space  back  to  model  space.  Inverse  operators  are  in 
general  difficult  to  construct,  but  adjoint  operators  through  the  bilinear 
identity  are  straightforward  to  obtain.  In  addition,  adjoint  operators  have 
very  useful  properties  which  aide  in  the  construction  of  inverse  and 
generalized  inverse  operators. 

Consider  a  linear  problem.  Frh  =  poba  ,  with  noisy  observations  but 
where  the  nullspace  of  F  is  zero.  N{F)  -  5 .  The  nullspace.  N(F)  . 
represents  the  subspace  of  the  model  space  that  satisfies  Frh  -  5  .  The 
standard  least  squares  solution  results  by  using  the  adjoint  theorem. 
r\f)  =  N{F*)  .  where  R  is  the  range  of  Frh  in  data  space.  Thus 
mapping  Frh  and  pobs  through  F *  annihilates  the  component  of  pob3  tn 
R  {F)  resulting  in  a  consistent  set  of  equations.  Then.  F'  Frh  -  cr 

fh  -  {F* F)~xF*pots  For  N{F)  =0.  (F'F)  is  full  rank  and  an  inverse 

exists.  The  standard  least  squares  solution  thus  results  from  two 
operations  on  the  data.  poba  First,  the  data  is  operated  on  by  the  adjoint 
operator.  F *  .  which  projects  pobt  from  data  space  to  model  space.  This 
blurred  .mage  in  model  space  is  then  filtered  with  the  operator  (F'F)~l 
When  N(F)  *  0  ,  a  generalized  inverse  can  be  computed  by  using  additional 
adjoint  theorems.  This  ts  equivalent  to  the  Lanzcos  formulation  which 
results  in  a  minimum-norm  least-squares  solution  [Aki  and  Richards. 
1980]  An  alternative  is  to  stabilize  the  [F’F)  operator  using  a  maximum 
likiiiood  procedure  which  reduces  to  minimizing  a  functional  like  (9)  with 
appropriate  data  and  model  covariance  operators  defined.  Any  a  prion 
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knowledge  about  the  component  of  the  model  in  the  null  space  can  be 
incorporated  into  the  final  solution.  This  might  include  smoothness  of  the 
solution. 

The  maximum  liklihood  algorithm  of  Tarantola  and  Valette  [1982]  is  a 
linear  iterative  procedure  for  solving  the  nonlinear  inverse  problem,  (8),  by 
minimizing  the  functional  (9).  It  can  be  written 

=  ™-k  +  [  Hk  *  K&P  -  (mfc-mpncr)  1  (’2) 

where  the  term  in  the  brackets  is  the  negative  gradient  of  the  functional 
S(rft)  .  and 

Ht={:  +  F;Ft\  (13) 

Covariance  operators  can  be  introduced  by  a  suitable  definition  of  the 
data  and  model  space  scalar  products  [see  Taranto.a.  1984c]  Thus 

..  —  —  7V.-1  «♦ 

<771 1  ,  7712 

<Pl’Pz>p  =PT\C?XP  2 

where  fhT  is  the  transpose  of  rft  ,  Cm  is  the  model  covariance  operator, 
and  Cp  is  the  data  covariance  operator.  From  this  definition  of  the  scalar 
products,  the  adjoint  operator,  Fk  ,  of  Fk  is  related  to  the  transpose  by 

Fi  =  CmFlC-'  (14) 

The  adjoint  is  equal  to  the  transpose  when  Cm  =  /  and  Cp  -  I  For  a 

complex  operator  F.  the  transpose  is  replaced  by  the  conjugate  transpose 

operator. 


A  major  problem  with  the  iterative  scheme  (12)  is  that  in  discretized 
form  it  requires  the  inversion  of  a  possibly  large  matrix.  Since  the  goal  is 
to  iteratively  solve  the  original  problem,  (8),  only  an  approximate  solution 
of  the  the  imbedded  linearized  problem  may  be  required.  Thus  a  simple 
iterative  scheme  could  be  written 

mfc+i  =  +  <*kWk7k  (15) 

where  yk  is  the  negative  gradient  of  the  functional  S(m)  ,  Wk  is  called  a 
preccnditioner.  and  ak  is  a  scalar  that  approximately  solves  a  l-D  line 
search  of  5(rfl)  in  the  direction  Wkyk  .  ak  is  thus  chosen  to  minimize 
S(7hk  +akWkyk)  with  respect  to  ak  giving 


ak 


<  Wk%MkWkyk> 


(16) 


where  Hk  is  given  in  (13).  For  the  maximum  hklihood  algorithm,  (LI). 
Wk  =  Hkx  giving  ak  =  1  .  This  would  then  be  similar  to  a  modified 
Newton's  method  for  solving  the  nonlinear  problem.  (8),  dependent  on  the 
choice  of  the  covariance  operators  used. 


Various  simplifying  choices  for  Wk  could  be  considered.  Choosing 
Wk  -  I  results  in  the  steepest  descent  method.  Thus 

rhk+x  =7hk  +  akyk  (P) 

where 


-w".1  ■ 1  ■> ■w'v  ■  -w  — pwmnvfTv 


-  <?*■?*> 

0fc  <7k,Hc7k> 

with  /4  given  in  (L3).  For  a  quadratic  5(m)  ,  yk and  ?fc  will  be 
perpendicular.  One  thus  moves  in  the  direction  yk  until  tangent  to  a  level 
curve  of  S(m)  .  At  this  point,  yk+i  is  chosen  perpendicular  to  the  level 
curve  in  a  direction  of  maximum  descent. 


Another  possible  choice  for  Wk  is  [see  Tarantola,  1984a] 

wk  =  [flHC  (fi)  j*1 


(18) 


This  is  similar  to  a  modified  Jacobi  method  with  suitably  defined  covariance 
operators. 


in  any  of  the  preconditioning  strategies,  the  single  step  convergence 
properties  for  quadratic  functionals.  S(m)  ,  are  governed  by  the 
difference  in  the  smallest  and  largest  eigenvalues  of  WkHk  with  Hk  given 
in  (13).  From  Luenberger(  1984), 


5{mk  +l)  s 


^"nax 


S(rhk) 


Thus  the  closer  fVk  is  to  Hkx  ,  the  better  the  single  step  convergence  A 
general  overall  strategy  is  to  construct  a  preconditioning  scheme  that  is 
easy  to  compute  and  possesses  a  favorable  eigenvalue  structure  at  each 


171 


Instead  of  using  the  gradient  directions  in  the  steepest  descent 
approach,  improved  convergence  for  very  Little  extra  effort  can  be  gotten 
by  using  so  called  conjugate  gradient  directions  defined  with  respect  to  a 
new  inner  product.  <x,x>  =  xr/&x  where  £&  is  given  in  (13).  The 
Fletcher-Reeves  implementation  of  the  conjugate  gradient  method  to 
nonquadratic  problems  is  briefly  outlined.  First,  given  r\r and 
%  =  •  set  20=%.  Sow  let  =  tt^  +  ak3.k  where  ak 

minimizes  S{ff>.k  +  ak3.k)  The  next  direction  of  descent,  is  chosen 

to  be  Hk  -orthogonal  to  the  previous  direction  <2*  .  This  is  done  by  setting 
+  Pkdk  •  where  0*  =  <%+i.7k+\>/  <%.7k>  •  4:+i  is  thus  a 

direction  modified  from  the  steepest  descent  direction.  In  a  similar 
fashion  to  steepest  descent,  3-k+\  and  £k  are  orthogonal  but  with  respect 
to  Hk  .  The  conjugate  gradient  method  has  the  important  property  that 
the  estimate  r\k  minimizes  a  quadratic  functional  5(m)  over  a  subspace 
spanned  by  all  the  previous  directions  (c?o.  '  '  •<?*- 1)  [see  Luenberger. 
1984; 


411  of  the  above  methods  require  the  the  linear  sensitivity  operator  Fk 
and  the  adjoint  of  this  operator  Fk  For  example  for  the  scalar  wave 
equation  (2) 


— Plr 


.f.r,)  y{r  ,t  \r g)  = 


where  velocity  v  is  the  model  parameter  Thus  to  first  order  6p 
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Using  the  bilinear  identity,  (11).  with  Cm  -  I  and  Cp  =  /  and 
homogeneous  boundary  conditions,  then  F *  -  FT  and 

<6pS6v>p  =  dt  6pF(6v) 

r.  r, 

=  SS/ fdV(T ') 

r»  Tt 

Then, 


u3(r’) 


'(T‘,t.ra)  0g(r', 


\t:rg)6v(r) 


<F*6p  ,6v>u  =  f  dV(r')F'(6p)6v 
fdV(r‘)  *  9{r‘.t\rg)^p{rg.t\Ts)6v(r  ) 


Thus  Ft  is  just  Fh  with  summations  now  over  geophone  and  source 
locations  and  an  integration  in  time. 


The  final  adjoint  operator  includes  the  data  and  model  covariance 
operators.  Assuming  uncorrelated  errors  over  source  and  receiver 
locations  and  time,  then 


where  crgs  represents  the  estimated  error  in  the  trace  corresponding  to 
the  gt>v  receiver  and  the  s 01  source.  For  the  model  covariance  operator, 
a  spatial  Gaussian  correlation  is  a  commonly  used  choice  (see  Aki  and 
Richards.  1980;  Tarantola.  1984a].  Thus 


m 


where  v  is  the  model  parameter  rh  ,  L  is  the  medium  correlation  length 
for  the  Gaussian  correlation  function,  and  av  represents  the  estimated 
departures  of  from  rhpnar{f)  .  As  L  goes  to  zero,  then 

Cyi? ?')  =  -?')  ■  The  model  covariance  operator  will  contrain  the 

iterated  model  parameters  as  well  as  act  like  a  spatial  smoothing  filter. 
The  complete  adjoint  can  be  written,  =  CyFfCf1  ,  as  in  (14). 

For  the  scalar  wave  equation,  the  preconditioned  descent  method  can 
be  written 

**♦1  =  +  a.  Hi,  (  C,FT6f  -  (ff*  -  tT^)  ]  (22) 

where 

6p'  =  Cf[6$ 

are  the  weighted  data  residuals.  Now  let 

<H7  =  C^Fl 6?‘  (23) 

517  =  Cv  Y^fdt  )  *  9  if  £  Tg)  tfg.t'f*)  (24) 

rt  r,  V3(f) 

This  can  be  rewritten  as 


W(?)  =  CvZYfdt  *6p'{fg>tfs)  (25) 

r»  r3  V  (*/ 


where  .the  term  tn  the  brackets  corresponds  to  the  propagation  of  the 


weighted  residuals  from  the  geophones  into  the  model  backwards  in  time. 
This  is  then  compared  with  the  computed  primary  field  from  the  sources  to 
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the  model  point.  A  summation  over  all  source  and  receiver  locations  is 
then  done  and  an  integration  over  time.  Finally,  the  a  priori  model 
covariance  operator  is  applied.  This  is  reminiscent  of  an  imaging  principle 
in  which  the  reflector  (velocity  perturbation)  exists  where  the  downgoing 
and  upgoing  waves  coincide  in  time  (Claerbout,  1976,  Tarantola,  1984b). 

In  order  to  implement  this  algorithm.  p(r,t:?t)  and  g(?,t\fg)  must 
be  computed  where  f  is  a  point  in  the  model  (  see  Figure  2  ).  Since  for  an 
inhomogeneous  medium  an  inexpensive  forward  modelling  scheme  is 
required,  one  possiblity  is  that  paraxial  ray  theory  or  the  Gaussian  beam 
method  be  used  to  calculate  p  and  g  (see  Cerveny  et  al.,  1982,  Nowaek 
and  Aki,  1984).  An  advantage  of  using  these  methods  is  that  no  two-point 
ray  tracing  is  required.  In  addition,  the  Gaussian  beam  method  produces  a 
smoothed  field  with  no  unphysical  singularities  in  amplitude  resulting  from 
caustics  which  may  have  adverse  effects  on  an  inversion. 

Using  the  ray  approximation  in  3-D. 

g(f.tTgI^Arrt6(t  -  Trrt) 

f ,)  *  Arr,S(t  -  Try.') 

where  ArTl  and  7^  are  the  amplitude  and  travel  time  computed  by  ray 

theoretical  methods  and  5(f)  is  the  second  derivative  of  the  source  time 
pulse.  For  the  2-D  case,  an  additional  n/  4  phase  shift  m  the  far  field 
must  be  included.  Equation  (24)  is  then 


rev*  wvwwuvw,/wvw’ m.Tsnrevwvwww v*w  w  wwk  n  uvuwi  w  -  :wvkvxvkvw  rewvi  -  \ 
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Figure  1 


f  igure  l  Sketch  of  hetei  -aeneous  body  with  surrounding  sources  and 
receivers.  A  simple  transmitted  waveform  is  shown 
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points  of  the  model.  receiver  e.aluated  at  the  interior 


is  the  cross  correlation  of  the  weighted  residuals  with  the  second 
derivative  of  the  source  time  pulse.  6Gk  is  the  result  of  the  weighted  sum 
over  the  phase  surface  through  the  data  defined  by  t  =  TTTj  +  7^  for  each 

model  point  r  . 


When  the  a  priori  starting  model  is  close  enough  to  the  true  Velocity 
model  so  that  the  Born  approximation  is  strictly  valid,  then  Fk  =  Fq  for  all 
iterations.  In  addition,  if  the  a  priori  model  is  homogeneous,  then 


=  CvFW  =  ' Mr,.t  =  m  .r.)  (29) 


where  c5jo  is  defined  above,  Rrrt  is  the  distance  between  source  and  model 
point,  and  RTTt  is  the  distance  between  the  geophone  and  the  model  point. 
This  is  similar  to  a  KirchhofT  migration  and  a  cross  correlation  with  the 
source  time  pulse  [Schneider,  1978;  Tarantola,  1984b;.  Thus  assuming  the 
Born  approximaticr.  is  strictly  valid,  then  a  velocity  inversion  using  a 
descent  approach  is  equivalent  to  an  iterative  sequence  of  KirchhofT 
migrations. 


A  complete  iterative  step  involves  a  preconditioner,  ,  and  a  scalar 
ak  ,  which  solves  a  l-D  line  search.  Wk  is  assumed  to  be  some 
approximation  to  (/  +  FkFk)~l  .  Approximating  Fk  by  a  matrix,  then 


F  -  Bij 

where 

Bij  -  -5 j-rP  (rj.t  ;r, )  *g  (r  ,t;rg), 

with  l  =  IJVp  where  ,Vp  is  the  number  o(  data  values  =  Nt  *.VS  *Ng  ,  and 
7  =  1  ,NV  where  ,Vv  is  the  number  o(  velocity  model  values.  Assuming 
a2 

Fk  -  -rrFl  with  C,  =  oil  and  Cu  -  oil ,  then 

of 

w*  =  (/  +  F:F„'hi  =  L+4  2 

Op  m*  t 

Thus 


(/  +  F:Fk\j  =  6ii  + 


2S 2  d'Av 37"" p (rt =r» )  V (r> Tg ) 

r,  r,  /V  (r;-y 


In  the  ray  approximation  this  can  be  written 

(/  *  =  6xj  + 


7  n.  ^  t  .  ,  Y,H’^Tirt’^Ttr.Ar,rsAT.r3~(t  -Tr,r  +  Tr<r  ir.r.~rrjr3i 

p^rjv^r,;  r,  r,  '  '  '  '  ‘  ;  * 


where 


With  a  high  frequency  source,  this  matrix  will  be  sparse.  For  a  diagonally 
dominant  operator,  ,  then  an  improved  single  step  convergence  can  be 
gotten  using  Wk  =  {DlAG{Hk))~l  .  where 


DUC(Hk)n=  l  + 

ypv  »■*  rt 


Thus  a  simple  preconditioned  iterative  scheme  can  be  written 


/  \  _  /  \  ,  ,  ,  4<7v3(0)  A  2  « 2 

vk*ivrJ  =  Wfcvrt)  +  a*  1  +  -5-57— 

Upv  \ri)  r,  ra 


r,  Tg  V  [  T  / 

When  the  a  priori  model  is  homogeneous,  the  starting  model  is  sufficiently 
close  to  the  true  model  so  that  the  Born  approximation  is  strictly  valid, 
and  with  av  -*  «  ,  then 


VfcnW  =^t(r »)  +afc8tr2v3(rl).  -j-  ££  ■  ■  ■  4 — 

,  aw  rt  r»  Krxrg 

nrf  1  l  aP  x  _  +  ^T<r )  _  1 

r»  r»  ^r»  ^(o)  v 


where  jc^/erj  includes  the  spatial  filtering  part  of  the  Covariance 

operator.  6p  is  given  in  equation  (28)  and  is  normalized  by  o*(0)/  (Jp  .  and 
ak  is  gotten  from  equation  (l 6)  The  factor  ££ 


Lh  |llJ»*J»,.M.Ul,tJ»\l|,»i|'»H  i^l  t>fc  «<,l.t  t  J  jl  A  Mi  lit  'III  i||.M  **! 
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approximately  corrects  for  geometric  spreading  A  simple  variant  of  this 
formula  will  be  used  in  the  examples  to  follow. 


1B1 


2.4.3  ALTERNATIVE  LINEARIZATIONS 

The  formulation  tn  the  previous  section  was  applied  to  the 
linearization  of  the  field  variables  General  limitations  to  this  approach 
are  summarized  in  appendix  1  A  simple  example  of  the  limitations  in  using 
a  linearization  in  terms  of  the  field  can  be  seen  as  follows.  Consider  a  field 
in  an  unperturbed  homogeneous  medium  (see  Brown.  [  1984 j) . 


u0^r.O  = 


6(t-R/vc) 


where  R  =  r  -  r0\  [n  a  slightly  perturbed  homogeneous  model  with 

velocity  =  vq  +  6v  .  the  fleld  is 

,  dC-K/e,) 

u'(r-!  -  4-r R  . 

The  perturbed  minus  unperturbed  field  is  then  approximated.  by  the 
linearization 


U!  -  u0 


•  ..  du  , 

=  QU  -J  -  OV 

dv 


6{t-R/vx)  OU  -R/y£_  ^  dj  ~  R/v  q) 

4rr  R  4rr  R  dv  4tt  R 


which  reduces  to 


<5(^  -R/v)  -5(t  -  R/vq)  *  6 \t  -  R/v0)  —  6v 
Thus  the  actual  differential  seismogram  is  approximated  by  by  a  constant 


times  the  derivative  of  a  delta  function.  As  seen  from  Figure  3.  depending 
on  the  frequency  content  this  will  be  a  poor  approximation  when 
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r  Igure  3  A  linsanzation  in  terms  of  the  field  approximates 
seismogram  on  the  left  with  the  doublet  on  the  right 
good  approximation  only  for  low  enough  frequencies. 


tne  differentia! 
This  will  be  a 
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is  large.  For  a  single  harmonic  frequency  m  a 


«  =  U--s.) 

Vl  V0 

gUjR/V 

homogeneous  medium,  then  u  =  ■  The  linearization  in  terms  of 

the  field  can  then  be  written 


or 


giuR/v\ 

4nR 


»u/?/v9  ,  .  p)  i'jR/V0 

l - *  6V  =  « - 

4rr#  du  4tt/? 


for  small  u/d  — - —  \  «l  .  then  the  exponential  can  be  approximated  as 


gliiOt  =»  i 


(  i  i  , 

+  i y /?| - —  •  Thus  for  sufficient^  low  frequencies,  such  mat 

(vi  vaj 


'~6t  «  l  ,  then  the  'linearization  will  be  a  good  approximation  to  the  actual 


differential  field. 


An  alternative  linearization  could  be  performed  :n  terms  of  the  log  of 

ho 

the  field  variables.  This  has  been  successfully  aoDlied  electromagnetic 

A 

problems  by  Madden  (personal  communication).  By  perturbing  the  velocity 
in  a  homogeneous  medium,  we  again  consider  the  extreme  case  of 

ka  »  l  With  ln(u0)  =  In 

InK)  -  Jn(u0)  *  -5(lmo) 


iuR/vj 

- - — -  .  then 

4nR 
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l«fl  - - — 

«i  yo 


»  iuR 1 


u| 


(v0)  <5  lnv 


'J 


J _ L 

ti,  u0 


*  ^wi  -  wo) 
vo 


Thus  for  this  simple  example  of  a  homogeneous  medium  perturbation  there 
is  no  dependence  on  frequency  for  the  complex  phase  linearization.  The 
region  of  validity  for  the  log  field  parameterization  is  more  involved  in  the 
general  case  where  there  is  an  amplitude  perturbation  as  well.  In  any 
event,  this  type  of  linearization  has  been  >ae3^successfully  applied  in 
travel  time  inversions  'see  Aki  et.al.  [1977],  Clayton  and  Comer  [l983j, 
where  only  the  imaginary  part  of  the  complex  phase  is  used. 


The  linear  sensitivity  operator  in 'terms  of  the  log  field  is  now  derived 
and  compared  with  the  3orn  linearization.  The  .og  field  linearisation  is 
additive  in  the  phase,  and  is  equivalent  to  Ryiov's  method  of  smooth 
perturbations  (see  Tatarskii  [  197 1 section  45:  fshimaru  [1978]  ).  Rytov's 
approximation  is  compared  in  Appendix  2  with  other  asymptotic, 
multiplicative  approximations  including  geometric  optics  and  the  parabolic 
equation  method.  We  start  with  the  He  imho  lz  equation 


Vhi  + 


=  0 


and  let  f  -  Infuff  ,u})  then  the  following  nonlinear  Riccatti  equation  s 
obtained 


195 


*i>  *  w;2  ♦  -^r  =  o 
v(*r 

Now  a  perturbed  problem  Is  derived  with  Y'  =  V'o  +  2  Vh  and 

v  =  v0tf )  +  e  v  j(f)  ,  assuming  c  «  t  .  Y'o  *  Vi  •  and  Uq  **  t»i  Then 

7%  +  (Vt£.0)3  +  -i^-. 

(*) 


+  e 


+  2V^07^,  - 


2u2ti1(x) 


”o  (•*) 


+  r 


(W2  + 


3cj2uf(f) 

i'Q4(2) 


=  0 


The  first  bracket  is  equal  to  7ero  by  solving  the  'nrvrard  problem  in  the 
unperturbed  medium.  u0(f)  .  Dropping  the  sSnrd  bracket  in  re¬ 
constitutes  the  Rytov  approximation  which  is  now  linear  in  Yh  and  ul 
Letting  <5v(f)=eu,  and  6y  -  cYh  gives 

V2(cty)  +  2VY0V(^)  = 

This  is  a  linear  equation  for  which  can  be  simplified  by  the 

substitution.  5^=^e^3 


V2^  + 


u2  -  _  2t;2dv  _*j 
qi/  =  - t— e 


w  2  fjf^ 


which  has  a  solution 
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or 


^(Vr»)  =  -fdV(r'9(rg 


r‘}8Wr,)  Z^6v 

v$ 


WT,.r,)  =  -f  dV(r  )g (tg tr')i*°^r  ,r''l“*0^ri'r''> ^2j}L 


(35) 


This  gives  the  linear  sensitivity  operator  in  terms  of  the  log  field  variables 

a-j. 

in  the  form  <5^  =  —  6v 


To  see  how  this  compares  to  the  Bom  approximation  or  a  linearization 
in  terms  of  the  field,  equation  (35)  is  exponentiated 

e<*  =  8*  -*>  =  c-/*l'(rK  0 


For  small  —JdV(r){  ) 


ef  S3  e*s 


Jd.V{r  )p  [r'.rt)g  {rg,r'} 


2~26v(r) 


which  is  equivalent  to  the  frequency  transformed  equation  (?)  Thus 
equation  (7)  can  be  derived  from  (35)  assuming  that  the  exponentiated 
argument  is  small.  It  appears  that  the  Rytov  approximation  includes  some 
multiple  scattering  effects  not  included  in  the  Born  approximation. 
However,  the  Rytov  approximation  was  originally  derived  assuming 
«  l  .  Thus,  there  has  been  an  ongoing  controversy  as  to  whether  the 
Rytov  is  in  fact  any  better  than  the  Born  approximation  (see  Tatarskii 
[1971],  Barabanenko  et.  al.  [  1 97 1  ] .  Aki  [1973],  Brown  [1966,1967],  Taylor 
[1967],  Yura  et.  al.  [1903]).  In  fact.  Aki  and  Richards  [190C]  derive  their 


expressions  for  the  log  amplitude  and  phase  from  the  exponent  , ted  Born 
series  (EBS) 

However,  asymptotic  equivalence  may  not  correspond  to  a  practical 
equivalence  between  the  two  approaches  as  shown  in  the  previous  simple 
example  Keller  [1969;  showed  that  the  Born  and  the  Rytov 
approximations  are  equally  accurate  in  their  dependence  on  the  small 
velocity  parameter.  €  .  but  show  quite  different  asymptotic  properties  as  a 
function  of  range  Mueller  et.  al  [1979]  noted  that  both  the  Born  and  'he 
Rytov  approximations  required  that  some  norm  of  the  velocity 
perturbations  over  the  volume  be  small  compared  to  unity.  In  addition,  the 
Born  approximation  required  small  lea  ,  where  a  is  the  scale  of  the 
heterogeneity,  in  agreement  with  the  range  of  validity  for  the  single 
scattering  formulation  given  in  Appendix  1.  Thus  .ow  spatial  wavenumber, 
band  limited  velocity  perturbations  appears  to  favor  the  Rytov 
approximation  From  the  context  of  an  inverse  problem,  it  is  natural  to 
estimate  the  low  spatial  wavenumbers  of  the  velocity  first  via  a  multiple 
scale  type  of  procedure.  For  small  scale  residual  heterogeneities,  '.he  Born 
approximation  to  the  field  may  have  an  advantage. 

Experiments  in  diffraction  tomography  comparing  the  Born  and  the 
Rytov  formulations  have  been  conducted  by  Kaveh  et  al.  t  i. 982, L 983 ^  who 
showed  that  when  a  ,arge  forward  scattering  component  exists,  the 
linearized  reconstructed  images  based  on  the  Born  approximation  were 
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more  in  error  than  the  Rytov,  but  the  Rytov  required  the  determination  of 
the  phase  of  the  scattered  waves.  There  are  numerical  difficulties  in 
obtaining  the  log  of  the  field  since  the  phase  of  the  complex  log  must  be 
unwrapped.  Still,  numerical  algorithms  for  this  exist  (see  Tribolet  [1970]). 
Also,  algorithms  based  on  the  smoothness  of  the  phase  at  adjacent  spatial 
points  have  had  some  initial  success  (see  Kaveh  et.al.  [1902],  Kaveh  et.al. 
[1904  ). 

Another  consideration  as  to  whether  to  use  a  linearization  ba«ed  on 
the  Bom  (field)  or  the  Rytov  (log  of  the  field)  approximations  is  the  noise 
structure  of  the  problem  (see  Tatarskii  [l97i]).  If  the  data  is  expressed  as 
a  sum  of  signals  plus  independent  Gaussian  noise,  .V  ,  then  the  linearized 
inversion  should  be  applied  to  the  waveform  or  to  fhe  real  and  imaginary 
Fourier  components.  !f  the  noise  structure  .s  multiplicative  and 
proportional  to  the  signal,  then  log  <V  wiii  behave  in  a  Gaussian  manner 
around  log  u  In  this  case,  the  log  should  be  taker.  ('see  Aki  and  Richards 
[1900],  pp  639)  Detailed  work  on  the  effects  of  additive  and  multiplicative 
noise  for  moment  tensor  tnversion  of  surface  wave  data  was  done  by 
Patton  and  Aki  [  1 979 ] 

If  the  noise  is  Gaussian  and  additive  in  the  field,  then  the  fluctuations 
in  the  amplitude  should  follow  a  Rayleigh  distribution.  If  the  noise  is 
Gaussian  and  additive  in  the  complex  phase,  then  the  fluctuations  m  the 
amplitude  should  follow  a  lognormal  distribution.  Experiments  of  light 
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propagation  in  the  atmosphere  show  closer  agreement  with  the  lognormal 
distribution  (see  Tatarskii  [l97lj). 


In  order  to  check  the  linearization  in  terms  of  the  log  field  for  a  large 
scale  heterogeneity,  a  homogeneous  velocity  perturbation.  6v  ,  ts  agam 
considered.  Equation  (35)  can  then  be  written 


fy{rg.rt ,) 
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'MR,  r-  *R.- 

i _ 3  C _ 
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Using  the  Fresnel  approximation  with  z  axis  oriented  from  the  source  to 
receiver 


fy(rg ,rs) 


^Jvz_Xf  ,  dz! . 
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Using  the  relation 


=  (7t/  a)**  .  then 


frp{Tg.T,)  = 


^  _  -cj2(5t>z  r  dz'  277 z’fz-z') 

*  '  (  7  —2  ' i 
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Of 


6v(rg,r,)  = 


-i  cjz  <5t> 
v02 


Now  let  6i(r  -  =  tuzf 


J - Llsj  ,  Thus,  the  linearization  in 

v,  v0  ”o 


terms  of  leg  variables  will  retrieve  a  global  velocity  change  in  one  iteration 
without  the  apparent  low  frequency  requirement  of  the  Born 
approximation  of  equation  (7). 
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There  are  several  basic  linearizations  that  can  be  used  in  inversion 
studies  including  those  based  on  the  additive  nature  of  the  field.  Born,  or 
the  log  field,  Rytov  Travel  time  inversions  are  more  similar  to  the  Rytov 
approximation  except  that  only  the  imaginary  part  of  the  complex  phase  is 
utilized  and  usually  only  first  arrival  times.  Keller  [1989]  noted  that  any 
advantages  that  the  Rytov  approximation  may  have  over  the  Born 
approximation  may  be  lost  when  the  waveform  contains  more  than  one 
wave  This  would  occur  when  there  is  high  wavenumber  velocity 
fluctuations  r  \using  significant  backscatter.  This  may  be  the  reason  for 
the  success  of  simple  migration  algorithms  which  map  small  scale 
reflectors  and  which  are  kinematically  equivalent  to  Born  inversion  (see 
Miller  et  al  [1984])  Another  consideration  as  to  whether  to  use  the  ing 
field  data,  or  the  amplitude  and  phase,  is  the  numerical  difficulties  in 
unwrapping  the  phase  whin  taking  the  complex  log,  although  work  on  th>s 
has  been  done  (  Triboiet  [1978].  Kaveh  [  1 982. 1 984  j) 

One  strategy  for  inversion  of  material  parameters  is  to  use  a  multiple 

l  J  . 

scale  procedure  in  which  the  large  scale  fuctuation  are  inverted  :or  firs*., 
and  compensated  for.  This  might  include  several  Rytov  iterations  followed 
by  a  series  of  Born  iterations.  A  compromise  approach  would  be  to  use  an 
iterative  travel  time  inversion  first  to  adjust  for  the  major  phase 
diffierences  and  then  use  the  Iterative  Born  Inversion  on  the  waveforms  to 
incorporate  more  of  the  data  and  fine  tunc  the  model,  in  the  following 

numerical  examples.  The  Born  l  nearization  is  used  on  res,duul  waveform 
data  assuming  small  velocity  perturbations. 


2.4.4  EXAMPLES 


In  this  section,  several  examples  are  given  using  a  linearization  in 
terms  of  the  fleld  as  described  previously.  A  simple  descent  method  is  used 


Vk+i  =  vk  +  aWk(Fktpk) 


(36) 


where  Fk  -  C^F^Cp  ,  Wk  is  a  preconditioner,  and  a  is  a  scaling  factor. 
For  an  initial  homogeneous  model  close  to  the  true  model,  then  Fk  *  Fq  . 
Operating  on  the  residuals  by  the  adjoint  is  then  given  in  (29)  as 


<517*  -  Fk6pk  * 


^  -SSi 


[Kr.Kr, 


Hrr+Rrr 

tp(rg,L  =  —! 


where  <5^  is  the  cross  correlation  of  the  seconc  derivative  of  the  source 
time  function  with  the  weighted  data  residuals,  Cv  is  the  model 
covariance  operator  and  includes  a  spatial  smoothing  function.  Here,  a 
Gaussian  smoothing  function  as  in  (21)  is  used.  Operating  on  the  residuals 
by  the  adjoint  is  kinematically  similar  to  a  KirchhofT  migration.  If  the 
preconditioner,  Wk  ,  is  chosen  as  /  ,  then  the  velocity  inversion  in  (36) 
with  Fk  =  Fq  is  equivalent  to  a  sequence  of  KirchhofT  migrations.  An 
alternative  choice  for  the  preconditioner,  Wk  ,  is  the  inverse  of  the 
diagonal  of  (/  +  F* F)  as  in  (32).  The  part  oi  this  preconditioner  that 
approximately  corrects  for  geometric  spreading  is 


Wk  = 


EE 


-l 


(3  7) 
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This  simple  preconditioner  will  be  used  in^ddition  to  the  identity  in  the 
next  example. 

The  geometry  of  the  first  example  is  shown  in  Figure  A.  There  are  nine 
sources  marked  by  x's  and  nineteen  receivers  marked  by  triangles.  The 
axes  are  in  km  to  give  specific  units  and  the  initial  velocity  is  8  km/sec. 
For  this  example,  a  single  smooth  heterogeneity  with  a  radius  of  10  km  is 
reduced  in  velocity  by  -.01  km/sec.  This  is  shown  by  the  circle  in  Figure  4. 
The  interior  box  in  Figure  4  shpws  the  region  where  the  velocity  is  to  be 
inferred. 

Figure  5  shows  the  seismograms  computed  using  the  Gaussian  beam 
method  for  the  single  smooth  heterogeneity  shown  in  Figure  4  for  three  of 
the  nine  source  locations.  The  source  wavelet  is  the  Gabor  wavelet 

f(t)  =  e~(‘2nfst/7^  cos(27t/o*  +  Vo) 

where  /0  =  4  hz,  y  =  3.  <p0  =  0  .  This  results  in  a  2  km  wavelength.  The 
correction  given  by  Cerveny  et.al.  [1982]  is  used  to  approximately  compute 
the  3-D  response.  In  Figure  5,  a  slightly  larger  amplitude  can  be  seen  on 
some  seismograms  for  different  source  locations.  For  example,  for 
S  =  (30.,  140.)  ,  the  amplitude  increases  due  to  focusing  at  a  range  of  180 
km  and  a  time  of  24  sec. 

The  residual,  perturbed  minus  unpe*  l,irbed,  seismograms  are  shown  in 
Figure  6  for  three  source  locations.-  The  amplitudes  of  the  residuals  are 
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about  ten  times  smaller  than  the  amplitudes  of  the  unperturbed 
seismograms  This  satisfies  the  Born  requirement  that  the  scattered  held 
be  small  compared  to  the  primary  held.  The  wavelet  cross  correlation  with 
the  data  results  in  a  symmetric  wavelet  centered  on  the  arrival  time,  and 
the  second  derivative  approximately  results  in  a  factor,  -v*  .  assuming  a 
Cabor  wavelet. 

Figure  7  shows  the  result  of  back  propagating  the  residuals  into  model 
space  using  the  adjoint  operator.  This  would  be  the  first  iteration  in  a 
descent  procedure  with  Wk  =  I  and  Cv  =  av6{f  -?’)  .  A  grid  of  100  by  100 
points  has  been  evaluated  in  model  space.  The  result  in  Figure  7  is  scaled 
from  -l.  to  +1..  The  contours  are  at  (-.75,  -.25,  .25,  .75)  with  the  -  75 
contour  in  the  center  The  parallelogram  and  the  small  plus  represent  the 
location  of  the  true  heterogeneity  The  data  residuals  appear  to  be 
streaked  along  lines  from  the  receivers  to  the  stations  in  model  space.  The 
response  to  the  simple  transpose  operator  for  a  single  heterogeneity  thus 
has  a  streaked  appearance  In  model  space. 

There  are  additional  operations  that  can  be  included  in  an  individual 
velocity  inversion  step.  This  includes  a  more  involved  preccnditioner  as  in 
equation  (37)  which  approximately  corrects  for  geometric  spreading,  and 
is  an  approximation  to  filtering  the  backprojection.  Figure  8  shows  the 
result  of  using  this  preconditioner.  It  has  the  effect  of  reducing  some  of 
the  streaks  in  the  baoKprojection.  Again,  the  true  heterogeneity  is 
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Figure  6  the  first  iteration  cf  a  velocity  inversion  for  a  s 
heterogeneity  with  H*  *iven  by  eqfi.  $7)  and  t;  =  dv6{? 
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represented  by  the  parallelogram. 


Another  operation  that  can  be  app'ied  to  the  backprojeotior.  is  spnti&l 
smoothing  by  the  model  covariance  operator  based  on  a  prion  informe. Uon. 
Figure  9  shows  the  result  of  smoothing  the  backprojection  by  a  Gaussian 
smoothing  filter  with  a  5  km  radius  This  filtering  operation  d’shiristes  Jae 
strenks  and  localizes  the  heterogeneity.  This  heterogeneity  's  slightly 
elongated  in  the  z  direction  compared  to  t.ne  true  heterogeneity 
represented  by  the  parallelogram.  This  results  from  sources  below  and 
receivers  above  the  heterogeneous  region. 

To  check  the  sign  of  the  derived  heterogeneity,  the  example  was 
redone  with  a  positive  instead  of  negative  velocity  heterogeneity,  and  the 
inferred  velocity  was  also  positive  A  final  step  in  an  teration  is  the  choice 
of  the  scalar,  a  ,  which  specifies  how  far  in  a  particular  descent  direction 
one  moves  A  particular  value  for  the  scaling  factor  can  be  computed  from 
equation  ( 16).  although  other  values  can  be  used  to  over  or  urder  relax 
the  problem.  The  scaling  factor  gotten  to  match  the  'ingle  hcierogeneity 
case  will  be  used  to  relax  each  iteration  step  in  the  next  examples 

In  the  next  example,  three  smooth  heterogeneities  are  used,  each  with 
a  velocity  hetorogeneitv  of  -.01  km/ sec  lower  than  the  background  The 
geometry  is  shown  in  Figure  10  The  sources  are  below  ti  e  hete-ogeno:ty 
and  the  stations  aoove  The  result  of  backprojectmg  the  fcsiduals  into 
model  space  and  applying  a  4  km  Gaussian  covariance  operator  is  shown  ih 
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Figure  11,  The  contour  interval  goes  from  -1.  to  +1.,  and  the  contours  are 
at  (-.75,  -.25,  .25,  .75).  The  true  locations  of  the  heterogeneities  are 
represented  by  the  parallelograms  Using  the  scaling  factor  gotten  from 
the  single  heterogeneity  case,  then  the  lower  heterogeneity  has  a  velocity 
perturbation  of  -.0109  which  is  approximately  accounted  for  in  this 
iteration.  The  upper  two  heterogeneities  have  about  half  that 
perturbation. 


figure  12  shows  the  second  iteration  result  of  backprojecting  and 
smoothing  with  a  4  km  Gaussian  model  covariance  operator  The  upper 
two  heterogeneities  can  be  seen  along  with  side  lobes.  Using  the  scaling 
factor  gotten  from  the  single  heterogeneity  case,  the  upper  two 
heterogeneities  have  a  velocity  perturbation  of  •  0052  km/sec  When 
combined  with  the  perturbation  frcm  the  first  ite-aticn.  then  all  thrae 
heterogeneities  have  been  accounted  for  in  the  invemion. 


In  the  next  example,  a  ring  heterogeneity  is  consider >.d  The  model 
geometry  is  showr  in  Figure  13  This  is  .an  idealization  of  the  structure 
derived  by  Thurber  [19P4*  under  the  Kilauea  volcano  using  a  t:r  e!  time 
inversion.  He  found  a  high  velocity  core  of  the  volcano  surrounding  an 
interior  lower  velocity  region,  which  was  inferred  to  de  a  magma  complex 
For  tr  f  numerical  example  just  the  ling  is  reduced  in  *c«* y  b,  •  * ; 
km.'aec  Tig-rs  1  1  show  .he  result  ..f  smoo&m.g  the  u  ,c’  p:„, t-ct.jr.  .  th  a 


Gaussian  model  covanurue  operator  with  a  4  km  radius  The  contours  go 


Figure  11 


Figure  11  The  first  iteration  in  a  velocity  inversion  f  -r  ree  smooth 
heterogeneities  with  Wk  -  I  and  Cy  given  spatially  by  a  Gaussian 
function  with  a  4  km  radius. 
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from  -1.  to  +1.  with  a  .25  contour  interval.  The  velocity  perturbation  forms 
a  ring  with  the  largest  pertubations  on  the  sides  of  the  ring  Using  a 
scaling  factor  from  the  single  heterogeneity  case  gives  an  amplitude  of  -.0? 
on  the  sides  of  the  ring,  or  twice  as  large  as  the  true  heterogeneity. 
Smaller  perturbations  form  the  the  tope  and  bottom  of  the  ring.  In  order 
to  examine  the  next  iteration,  a  scal.ng  factor  of  half  of  this  is  used  so  as 
to  compensate  for  the  sides  of  the  ring  The  bottom  of  the  ring  is  half 
compensated  for.  The  backprojection  for  the  second  iteration  with  a  4  km 
Gaussian  covariance  operator  applied  is  shown  in  Figure  15.  Using  the 
scaling  factor  for  the  single  heterogeneity  for  this  iteration  gives  a  velocity 
of  -.011,  which  approximately  compensates  for  the  top  and  bottom  of  the 
ring. 

When  applying  this  iteration  procedure  to  actual  data,  the  scaling 
factor,  or  ,  must  be  approximately  derived  using  equation  (13),  or  an  eve” 
relaxed  version  of  it.  In  these  numerical  examples,  only  a  rough  value  was 
t..->ed  to  see  how  convergence  proceeds.  Nonetheless,  choice  o:  this  ocaling 
factor  is  essential  part  of  the  prodem. 

Finally,  a  simple  comparison  of  these  results  with  travel  time 
inversions  using  the  same  model  geometries  is  done  The  modei  s  divided 
into  10  by  !0  blocks  or  '00  modei  parameters.  Ui  travel  times  ■"•m  u~e  :, 
not  a!i  f  which  give  ,-ei\  infermatioft  about  the  modei.  A  damped  n  version 
is  used  The  result  for  a  single  heterogeneity  is  shown  m  Figure  16a  Using 
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jure  15  The  second  iteration  for  a  velocity  inversion  for  a  r: 
heterogeneity  with  Wk  and  given  spatially  by  a  Gaussian  with  a 
km  radius. 


a  damping  of  .001.  a  velocity  of  -0102  km/sec  is  gotten.  This  result 
compares  with  the  waveform  inversion  shown  in  Figure  8.  The  travel  time 
inversion  tor  three  heterogeneities  is  shown  in  Figure  16b.  This  is  agam 
similar  to  the  two  iteration  waveform  result.  Finally,  the  travel  time 
inversion  result  for  the  ring  structure  is  shewn  in  Figure  16c.  In  this  case 
the  waveform  result  may  do  slightly  better.  As  discussed  in  a  previous 
section,  tor  large  soale  heterogeneities,  a  complex  phase  linearization  may 
be  more  robust.  Travel  time  inversions  are  in  this  class,  m  whicn  only  he 
imaginary  part  of  the  phase  is  used  with  first  arrivals.  However,  for  small 
velocity  perturbations  the  results  are  comparable.  In  actual  cases, 
waveform  inversions  allow  for  a  more  complete  use  of  the  data  when  the 
phase  perturbations  are  small,  or  have  been  previously  compensated  for. 
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2.4.5  CONCLUSIONS 

In  this  paper,  a  linearized  inversion  procedure  for  material  parameters 
has  been  investigated.  !n  this  procedure  a  linear  sensitivity  operator  must 
be  derived  and  can  be  computed  economically  by  using  reciprocity  of  the 
Green’s  function.  Data  errors  and  a  prior  model  information  are  included 
via  covariance  operators.  Large  matrix  inversions  are  avoided  by  using 
descent  algorithms.  A  fast  forward  modelling  scheme  is  required  and  here 
the  Gaussian  beam  method  for  a  laterally  varying  medium  is  used.  This 
allows  for  inhomogeneous  initial  models. 

Different  linearizations  are  possible  including  a  linearization  in  terms 
of  the  field,  the  Born  approximation,  and  a  linearization  in  terms  of  the  log 
field,  the  Rytov  approximation.  Travel  time  inversions  are  in  the  class  of 
linearizations  for  the  log  held,  where  only  the  imaginary  part  of  the 
complex  phase  is  used  with  first  arrivals.  Log  field  linearizations  may  be 
more  robust  than  field  linearizations  for  large  scale  heterogeneities  where 
forward  scattering  predominates,  but  phase  unwrapping  may  be  difficult 
numerically.  The  linearization  in  terms  of  the  field  for  small  perturbations 
from  a  homogeneous  background  is  kinematically  equivalent  to  a  sequence 
of  Kirchhoff  migrations.  Field  linearizations  are  expected  to  be  useful  for 
small  scale  heterogeneities  which  result  in  scattering  effects  that  are 
additive  in  the  field.  Several  numerical  examples  are  performed  using  a 
field  linearization  which  identify  the  trial  structures.  A  comparison  is  then 
done  with  travel  time  inversion  results. 
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2.4.6  APPENDIX  1 

In  this  appendix  a  brief  review  is  given  for  the  conditions  of  validity  of 
the  Born  approximation  in  a  continuous  fluctuating  medium  characterized 
by  a  spatial  Gaussian  correlation  function  where 


where  M(r)  =  -<5t/(r)/v(r)  ,  and  £|  •  •  •  j  represents  the  expectation  over 
an  ensemble  of  random  media,  and  a  is  the  correlation  length,  if  the 
process  is  ergodic  then  this  expectation  can  be  replaced  by  a  spatial 
average  over  one  realization.  The  fractional  loss  of  energy  to  the  primary 
wave  for  a  medium  with  velocities  with  a  Gaussiar  correlation  function  is 
(see  Aki  and  Richards  [1980|) 

=  VrfE^x2jj:aA:£(  l  -  e  ■*“*"] 

For  the  Born  approximation  to  be  vaiid  then  ~  «  \  .  or 

«  l  fca  »  l 

Vrrf£^yU,2]ifc3ct3A:Z,  «  1  fea  «  L 

where  L  is  the  propagation  distance  and  a  is  the  correlation  distance. 
Thus,  ka  »  l  refers  to  large  scale  scatterers  and  ka  «  |  refers  to  small 
scale  scattcrcrs  with  respect  to  the  wavelength,  ka  «  l  correpsonds  to 
Rayleigh  scattenng.  For  large  scale  scatterers,  ka  »  1  ,  then  the  mean 


square  velocity  variation  must  be  small  for  the  strict  validity  of  the  Born 
approximation.  For'  small  scale  seatterers.  relatively  large  mean  square 
velocity  fluctuations  can  be  accomodated  by  the  Born  approximation. 

Thus 


ka  =  10. 
ka  =  2.0 
ka  -  0.5 
ka  =0.1 


kL  «  .056 
ki  «  .280 
kL  «  4.50 
kL  «  564. 


For  a  given  propagation  distance.  L  ,  linearizations  based  on  the  Born 
approximation  will  favor  small  scale  fluctuations. 


'/v  v  .*»  v»  <  •  v  f>  v  r  j  r/  v  v  /  v  w  /v  ■/  v  v  •> 


2.4,7  APPEND rX  2 


In  this  appendix  a  comparison  is  done  between  perturbation  solutions 
which  to  first  order  are  addititive  in  the  complex  phase  or  multiplicative  in 
the  field.  These  include  the  first  order  geometric  optics  approximation,  the 
parabolic  equation  method,  and  the  Rytov  approximation.  These  types  of 
approximations  work  best  in  a  smoothly  varying  media  with  predominantly 
forward  scattering  First,  the  geometric  optics  approximation  is  outlined 
and  compared  with  the  standard  parabolic  approximation.  These  are  then 
compared  to  the  Rytov  approximation. 


Starting  with  the  Helmholtz  equation,  the  first  order  geometric  optics 
approximation  has  the  following  general  trial  solution 

+  (.4.2.1) 

where  £  is  a  large  parameter  and  v’ofs,Vi-  In  this  approximation  k  is 

usually  identified  with  an  average  wavenumber  and  large  ic  implies  a  small 

wavelength.  By  substituting  this  into  the  Helmholtz  equation  then 


Ic2  V<pQV<p0  +  k2/  Ic2 
+  Ic  V2?0  +  2V<PqV<Pi 
+  |  +  V^V  <px\  =  0 


(.4.2.2) 


where  k-u/v{£)  The  particular  trial  solution  usually  used  for 
geometric  optics  is  u  =.4(x)e^5^^  where  k  =cj/<v>.  Thus  >p0=iS(x) 
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and  <px  =  liL4(f)  and  (A.2.2)  can  then  be  written  as 

+k*/£*  (A.2.3) 

+  2t£j|-V2S  +  V5Vliu4 

+  ^VZA/A  =  0 

For  large  fc  ,  the  first  geometric  optics  solution  involves  dropping  the 

V2A/ A  term.  The  Eikonal  equation  results  from  equating  the  first 

bracket  to  zero  and  the  transport  equation  results  from  equating  the 
second  bracket  to  zero.  The  tansport  equation  relates  the  the  imaginary 
part  of  the  phase  to  the  real  part  of  the  phase. 

The  standard  parabolic  equation  method  is  derived  by  using  for 
(A2.2.1)  the  particular  trial  solution  u  -  where  U  is  assumed  to  be 

a  slowly  varying  function,  then  < =  ix  and  <pi  =  In U(2)  .  (A.2.2)  can  then 
be  written 


For  large  £  ,  the  Eikonal  equation  give  k  =  £  ,  and  the  transport  equation 
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gives  Uj  =  0  .  The  parabolic  approximation  results  from  neglecting  the 
U a/  U  in  the  Anal  bracket  but  retaining  the  transverse  part  of  the 
Laplacian,  V|„p  =  UtVV  +  Ut„  .  This  is  not  entirely  consistent  within  the 
perturbation  scheme  used  but  it  gives  a  one  way  operator  in  x  , 


»  ' 

2 i£ux  +  Vpgrp  U  +  k2-lcz  U  =  0 


(A.2.5) 


The  transverse  Laplacian  is  called  the  diffraction  term  and  the  term 
kz-&  is  called  the  thin  lens  term  and  locally  adjusts  for  the  refractive 


index. 


The  Rytov  approximation  results  by  using  the  trial  solution 


u  =«**** 


(A. 2  6) 


where  e  is  a  sma.l  parameter,  and  %  *  Y'i  •  The  velocity  is  also  expanded 
as  =  v0(i)  +  nij(f )  ,  where  v0«v,.  Substituting  this  into  the 

Helmholtz  equation  gives 


V2^0  +  (V^o)2  +  cj2/v|(f) 


+  ejv2^,  +  2V^0V^j  -  2cj2Vj(i)/  vq(i' 


(A.2.7) 


+  f2  (V^j)2  +  3cj2v2  {£)/  Vq  (£)|  =  0 


where  the  first  term  in  the  brackets  is  a  solution  of  the  Helmholtz  equation 
with  v  =v0t£)  and  u  =  ef:  In  the  first  Rytov  approximation,  the  t2 


term  is  dropped.  Equating  the  second  term  to  zero  gives 

V2^  +  2V^0V^!  =  2v  ju2/  v$  (A  2.8) 

In  order  to  compare  with  the  previous  approximations  let  %  =  lc<p0  and 
y  =  tpx/e  =  lccpl  ,  then  (A. 2.7)  can  be  written 


+  (^,+£2^0^.  + ^^4 

\k  VQ 


(A.2.9) 


where  fc  =  u/  <v0{2)>  ,  k q  =  cj/vq(2)  ,  and  k  «  fco  •  The  Rytov 
approximation  again  retains  the  first  two  brackets,  where  the  first  bracket 
is  assumed  known.  The  £2  terms  relates  to  the  Eikonal  equation  in 
equation  (A.2.2),  only  now  related  to  the  velocity  model  v0{x)  The  k 
terms  correspond  to  the  transport  equation  in  (A.2.2)  plus  a  correction 
term  to  the  velocity  linear  in  Vj(f)  .  Finally,  the  term  V2^,  is  retained, 
but  not  the  term  (V?j)2  and  the  higher  order  terms  containing  the 
velocity  correction  U](x)  .  This  is  similar  to  the  parabolic  approximation 
where  again  only  a  part  of  the  third  bracket  in  (A.2.2)  is  retained.  In  that 
case  the  transverse  Laplacian  perpendicular  to  the  predominant  direction 
of  the  incident  wave  is  retained.  For  example  if  we  iet  vQ  be  a 
homogeneous  velocity,  and  V'o  =  ^  -  then  by  dropping  terms  in  e2  ,  (A. 2. 7) 
can  be  written  in  terms  of  U  =  e**1 


U  =  0 


2ileUx  + 


v2c/.iv^ij_!£^ 

u  \  vg 


This  is  similar  to  the  parabolic  equation  given  in  (A.2.5)  with  the  thin  lens 
term  approximated  to  first  order  in  ev,(f)  and  the  term  U  replaced 
by  V2  U-(VU)2/  U  .  Using  the  Transverse  Laplacian  results  in  a  simpler 
equation  but  the  Rytov  equation  above  is  consistent  to  order  c  . 


2.4.8  APPENDIX  3 


In  this  appendix,  the  acoustic  wave  equation  is  investigated  in  a  simple 
1-D  example,  where 


L  = 


K{r)  dt2 


The  transposed  linear  sensitivity  operators  for  a  perturbation  in  density 
and  bulk  modulus  from  the  held  are  given  inTarantola  [1984a]  where 


6Kk  =  U£&p  = 


'JrrZ'Lfdt  P(r-i-rs)  09(r.t:rg)  }<5p(r?,*.rs) 


<5 Pk  ~  V&P  ~ 


~4~r  ZZftt 

P  (r)  r,r, 


Vp  (r,t  :rg)  •9g{r,t\rg)  j  op(rg,t,rM) 


or 


&K  =  TTrrfdt 


Eg(r.-*V'  *  6p  (rg,t  rs/J  |A/J  y 


<5 Pk  =  -TTT  fdt  2  7P(r-^r */ 

Pz(r) 


^  Vg{r-t\rg}  •  6p$rg  ,t  ,r%) 


(A.3.2) 


In  the  l-D  case,  a  coincident  source  and  receiver  are  considered  where 
rg  -  r,  =  z  =  0  Small  perturbations  of  a  homogeneous  starting  model  to 
the  true  model  will  be  assumed.  The  observed  reflection  seismogram  is 

P0  «wiV*:rs)  =  ^o«w(0 


"AWV^V  i  K  ,*l  >  //A  V  V  V  ^  .VtrjrtMrw^  V*  w  5  *  j*  *  *  j»  »  r<*  r  j*  j*  *  ^  v 


and  the  predicted  seismogram  is 


Pk(rg,t:r '»)  -  pk( 0 
The  differential  seismogram  is  then 

6pk(rg,t:rs)  =  Poka{t)  - Pk{t ) 

The  operation  in  (A. 3.1)  can  then  be  written 

<5A*  =  g1  --fdt  pk(r,t;rs)  <g(z  ,t;0)6pk{t)> 

Afc  (r , 


where  <  >  denotes  a  cross  correlation.  The  analogous  operation  on 

5p'k  in  (A.3.2)  can  be  written 


6pk  =  fdt  j-pfc(z.r.O)  -£-<<7(z,f:0)<5pfc(O> 
Pf(z)  az  az 

W,th  an  mtial  homogeneous  model  with  velocity  v  nen 

g(z  ,t . 0)  =  <5(£  ~  z/v) 

and 


Then 


<g  (z  ,t  ,0)6pk(t  )>  -  6pk{t  +  z/v) 


<5Kk(z)  =  ^  fdt-^pk{z.t.0)5pk{t  +  z/v) 

At  (Z  ) 


dt 


6pk{z) 


Jjfcr  + 


Now  with  Pk(z  3'  =  <5(£  -  z / v)  then 


<s*»  =  7r-r5P»(‘ 

A*  (z) 


2z/v) 


.vwu  vi'/lva  \  a '.vvvi\  S.V.TX  » .tv »  •. w.Ti <r. a r.aoarijrwfitfv 


where  :  -Zz/v  places  the  model  perturbation  at  z  -vt/Z  Since 
v  -  V K/ p  and  77  =  pv  =  impedance,  then 

6v_  \§K_  §£_ 

v  “|  K  p 

5t)  _  5K_  5p 

7}  ~\K  p 

Including  the  model  covariances  this  can  be  written  assuming, 

CK  *  o£  s*  x£{z ) 

-2 

6Kk{z)  -  C^^z)  «  -£—6p(Zz/v)  *6p(Zz/v) 

Xiciz) 

and  with  *  pk{z)  . 

a  2  , 

6  pk  -  Cp5pk  *  P  ip  {Zz/v)  =»  -y<5p(2z/u) 

Perturbations  in  velocity  and  impedance  can  then  be  written 

—  =  4-,5P(22/v)  =  ° 

v  2  a  v  *p 

and 

^  ^  4-  ^<5p(2z/v)  =  l4p(2*/u) 

Thus  for  the  l-D  rejection  seimogram  modelled  using  the  acoustic  wave 
equation,  there  is  no  velocity  perturbation,  but  there  is  an  impedance 
perturbation.  This  is  as  expected  from  the  l-D  exact  inversion  results. 
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3.  DIRECT  MEASUREMENT  OF  THE  MANTLE  ATTENUATION  OPERATOR 
FROM  BROADBAND  P  AND  S  WAVEFORMS 
3.1  INTRODUCTION 

Compelling  evidence  for  the  existence  of  frequency  dependent  Q  has  come  primarily 
from  comparisons  of  amplitude  ratios  of  body  waves  recorded  by  instruments  with  widely 
separated  and,  often,  narrow  frequency  bands  (e.g.,  Gutenberg,  1958;  Solomon,  1972;  Der 
and  McElfresh,  1977;  Sipkin  and  Jordan,  1979;  Lundquist  and  Cormier,  1980).  It  is  now 
widely  accepted  that  the  incongruity  posed  by  attenuation  models  derived  for  different 
frequency  bands  can  not  be  resolved  without  forsaking  the  assumption  of  frequency  in¬ 
dependent  attenuation  (e.g.,  see  the  review  by  Cormier,  1982).  Until  recently,  studies 
of  attenuation  over  a  broad  and  continuous  range  of  frequencies,  say,  from  several  Hz  to 
tens  of  seconds  were  few  (among  them  Uleg  and  Berckhemer,  1984;  Der  et  al,  1985).  A 
major  impediment  to  broadband  studies  has  been  that  conventional  seismograph  systems, 
such  as  that  of  the  World-Wide  Standardized  Seismograph  Network,  were  usually  designed 
to  filter  out  data  in  the  midband  range  of  seismic  frequencies  (about  0. 1-1.0  Hz).  With 
recent  improvements  in  digital  instrumentation  and  increasing  availability  of  broadband 
recordings,  it  becomes  imperative  that  the  behavior  of  intrinsic  attenuation  in  the  earth 
be  understood  across  a  broad  and  continuous  spectrum  of  frequencies  which  encompasses 
the  midband  range.  Clearly  our  ability  to  infer  physical  properties  and  processes  in  the 
earth  can  only  be  as  complete  as  our  understanding  of  geophysical  parameters  such  as 
attenuation.  More  importantly,  a  correct  description  of  body-wave  attenuation  is  critical 
if  studies  of  earth  structure  and  source  mechanism  are  to  be  conducted  properly.  The 
choice  of  an  attenuation  operator  in  the  computation  of  synthetic  seismograms  or  for  the 
correction  of  propagation  effects  in  data  can  inordinately  influence  the  interpretation  of 
the  rupture  process  of  earthquakes  (Hanks.  1981). 

A  new  method  of  measuring  the  frequency  dependence  of  Q  is  presented. 

The  method  consists  of  matching  details  of  displacement  and  velocity 

records  of  broadband  P-  and  SH-type  body  waves  with  synthetic  body  waves.  The  fre¬ 
quency  dependent  effects  of  a  finite  source  are  incorporated  in  the  synthetics  by  using  a 
source  theory  that  provides  a  realistic  and  causal  description  of  rupture  and  healing  for 
simple  earthquakes.  The  effects  of  attenuation  are  incorporated  by  using  dispersive  op¬ 
erators  conforming  to  the  Kramers-Kronig  relations.  The  conditions  imposed  by  causai 
rupture  and  causal  attenuation  predict  phase  spectra  that  can  behave  in  such  radically  dis¬ 
similar  fashion  that  the  tradeoff  between  source-time  function  and  attenuation  is  strongly 
minimized.  It  is  possible  to  observe  any  antithetical  behavior  in  the  phase  spectra  onlv 
with  waveforms  that  contain  information  over  a  broad  and  continuous  frequency  range 
At  the  same  time,  the  modeling  technique  is  consistent  with  assumptions  about  source  hi  - 
havior  that  are  commonly  u«ed  b\  attenuation  studies  that  model  the  asymptotic  behavior 
of  amplitude  spectra  at  high  and  low  frequencies.  First,  some  general  characteristics  of  broadl  and 

displacement  and  velocity  waveforms  are  described,  followed  by  a  description  of  the  technique  used 
to  generate  theoretical  seismograms.  Afterwards,  the  method  is  applied  to  the  analysis  of  P  and 
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SII  waveforms  senerated  by  a  Jeep  earthquake  that  occurred  in  the  Sea  of  Okhotsk.  Finally,  there 
is  a  discussion  of  the  implications  of  the  final  range  of  Q*,  and  Qj  models  on  the  mechanisms  of 
Tt.t.ennafion  in  the  earth. 

3.2  CHARACTERISTICS  OF  BROADBAND  WAVEFORMS 

The  data  analysed  in  this  paper  are  taken  from  digitally  recording  stations  of  the 
GDSN  and  RSTN  networks  and  the  GRF  array.  Operating  characteristics  of  the  seismo¬ 
graphs  may  be  found  in  Ganse  and  Hutt  (1982),  Engdahl  et  al.  (1982)  and  Harjes  and  Seidl 
(1978).  Except  for  that  of  GRF  array,  most  of  the  raw  data  are  not  directly  proportional  to 
either  displacement  or  velocity  over  a  wide  and  continuous  range  of  frequencies.  The  most 
common  seismograph  configuration  splits  an  incoming  signal  into  long-  and  short-period 
components.  Although  an  intermediate-period  component  is  recorded  by  the  RSTN  and 
many  GDSN  stations,  the  bandwidth  of  the  data  needs  to  be  augmented.  In  order  to 
obtain  broadband  records  of  displacement  and  velocity,  we  use  the  method  described  by 
Harvey  and  Choy  (1982)  which  recombines  data  recorded  on  separate  recording  channels 
by  a  multichannel  instrument  deconvolution.  It  is  the  high  dynamic  range  of  the  seismo¬ 
graphs  and  the  digital  format  of  the  data  which  enable  spectral  information  over  a  broad 
frequency  range  to  be  easily  retrieved.  The  accuracy  of  the  multichannel  deconvolution 
has  been  checked  by  a  comparison  of  the  deconvolved  P  waves  obtained  from  the  short- 
and  long-period  channels  of  the  SRO  station  at  GRFO  with  the  P  wave  recorded  directly 
as  broadband  velocity  at  the  same  site  by  a  GRF  station  (Choy,  1981).  The  accuracy 
of  the  multichannel  deconvolution  has  been  further  checked  by  comparing  the  processed 
waveform  with  the  waveforms  recorded  on  an  intermediate-period  component  at  man' 
stations.  The  P  waves  in  this  paper  that  are  obtained  by  this  processing  generally  have 
a  flat  displacement  or  velocity  response  from  0.02  to  5  Hz;  the  S  waves  generally  have  a 
flat  response  to  displacement  or  velocity  from  .01  to  between  2. 0-5.0  Hz,  depending  on  the 
recording  instrument. 

Details  of  the  rupture  process  which  are  suppressed  in  narrowband  data  are  olten 
obvious  with  broadband  data.  Some  of  the  advantages  of  broadband  data  are  illustrated 
in  Figures  3. 2-3. 4.  Figure  3.2  shows  the  broadband  P  waves  that  were  obtained  for  a 
deep  event  that  occurred  20  April  19ts5  in  the  5ea  oi  Okhotsk,  i  ne  gioual  distribution  of  the 
digitally  recording  stations  about  the  epicenter  is  shown  in  Figures. 1.  That  the  earthquake 
ruptured  in  a  complex  fashion  is  immediately  obvious  from  the  pattern  of  double  pulses 
that  can  be  seen  in  the  displacement  records  at  all  stations.  The  coherence  of  the  complex 
rupture  history  across  the  suite  of  stations  is  even  more  evident  in  the  velocity  records  The 
doublet  in  displacement  becomes  the  first  two  peaks  and  tl  e  ensuing  trough  in  the  velocity 
records.  The  strong  coherence  of  these  features  is  not  surprising.  Velocity,  being  directh 
related  to  energy  radiated  through  the  focal  sphere,  is  more  sensitive  to  the  dynamics  of 
the  source  than  is  displacement.  This  obvious  complexity  of  rupture,  however,  would 
virtually  undetectable  in  conventional  narrowband  form.  Figure  3.3,  for  instance,  shows 
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Figure  3.2.  Broadband  P-waves  from  the  GDSN.  RSTN  and  GRF  array  for  the  deep  Sea 
of  Okhotsk  earthquake.  Polarities  of  all  first  motions  have  been  plotted  as  positive.  Data 
are  plotted  as  solid  lines;  synthetics  are  plotted  as  dashed  lines.  In  some  displacement 
records  (right  column),  a  double  pulse  is  very  evident.  In  other  displacement  records, 
the  complexity  is  seen  as  a  strong  inflection  in  displacement.  In  all  the  velocity  records 
(left  column),  the  complexity  of  rupture  is  seen  as  the  first  two  peaks  and  the  subsequent 
trough. 
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Figure  3.3.  Three  different  displacement  pulses  are  shown  at  the  left.  {Top)  The  actual 
P  wave  at  COL.  ( Middle )  The  synthetic  displacement  convolved  with  an  attenuation 
operator.  ( Bottom )  An  arbitrarily  different  source  operator  convolved  with  the  same 
attenuation  operator.  This  source  operator  has  the  same  total  duration  as  the  source 
operator  used  in  the  middle  example,  but  has  no  rupture  complexity.  The  corresponding 
responses  to  convolution  with  a  long-period  seismograph  response  are  shown  at  the  right. 
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Figure  3.4.  Comparison  of  actual  P-wave  displacement  pulse  (solid  line)  with  frequency- 
independent  attenuation  operators.  With  a('  of  1.0  s  (long  dashes),  it  is  possible  to  match 
the  slope  of  the  rise  time  in  the  second  subevent  in  the  actual  data.  But  such  attenuation 
would  be  unable  to  match  the  pulse  duration  and  it  would  lack  the  high  frequency  content 
that  is  shaping  the  pulse.  A  constant  t'  of  0.6  s  could  match  the  pulse  width  of  the 
subevent,  but  would  be  unable  to  match  either  the  initial  "lope  or  the  decay  history  of  the 
data. 
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three  different  displacement  records  and  the  result  of  convolving  them  with  a  long-period 
seismograph  response.  The  actual  broadband  displacement  from  COL  shows  the  double 
pulse  complexity.  The  second  trace  shows  the  best-fitting  synthetic  record  using  the  source 
and  attenuation  operators  derived  later  in  this  paper  for  COL.  The  last  trace  shows  a 
much  different  displacement  record  obtained  by  convolving  the  attenuation  operator  with 
an  arbitrarily  different  source  operator.  This  source  operator  has  the  same  total  pulse 
duration  as  the  previous  operator  but  assumes  a  single  simple  source  using  the  model  of 
Sato  and  Hirasawa  (1973).  The  corresponding  long-period  seismograms  are  apparently 
indistinguishable  from  each  other  (Figure  3.3b).  The  effect  of  the  instrument  response  has 
been  to  suppress  major  differences  between  the  source  operators  beyond  visual  perception. 

Conversely,  it  is  obvious  with  broadband  data  that  taking  a  Q-model  derived  from 
low-frequency  data  and  extending  it  in  frequency-independent  fashion  to  high  frequencies 
cannot  adequately  describe  the  most  basic  attributes  of  a  displacement  pulse.  Two  basic 
features  of  a  displacement  pulse  are  the  slope  of  its  rise  time  and  its  total  pulse  width. 
Figure  3.4  shows  the  P  wave  displacement  at  station  COL  from  a  deep  earthquake.  In 
order  to  match  the  slope  of  the  second  and  larger  of  the  subevents  using  an  attenuation 
operator  alone,  an  operator  with  a  constant  t"  of  1.0  s  would  be  necessary.  But  such  an 
operator  would  be  patently  incongruent  with  the  data.  The  operator  would  be  unable  to 
match  the  short  duration  of  the  subevent  and  it  would  lack  the  high  frequency  spectral 
content  that  is  shaping  the  waveform.  If,  instead,  a  frequency  independent  t'=  0.6  s  is 
used  (which  also  happens  to  be  the  operator  that  would  be  predicted  by  the  frequencv- 
independent  Q  model  of  SLb  Anderson  and  Hart,  1978] ) .  the  resulting  waveform  would 
have  the  same  pulse  width  as  the  data.  However,  it  could  neither  match  the  initial  slope 
nor  the  decay  history  of  the  actual  data.  These  remaining  features  can  only  be  fit  by 
incorporating  more  sophisticated  features  in  the  modeling  process.  As  described  later, 
these  will  be  frequency  dependence  in  attenuation  and  a  realistic  description  for  the  rupture 
process.  It  is  important  to  emphasize  that  without  broadband  data  it  would  be  difficult 
to  evaluate  the  fit  between  various  attenuation  operators  and  the  data.  The  intermediate 
and  high  frequencies  which  would  otherwise  unmistakably  distinguish  the  effect  of  various 
attenuation  operators  on  a  source  function  are  often  suppressed  upon  convolution  with  the 
response  of  a  long-period  seismograph. 

Finally,  a  distinct  advantage  of  time-domain  analysis  over  a  purely  spectral  approach 
can  be  seen  by  comparing  the  P  wave  displacement  pulse  with  its  amplitude  spectrum 
(Figure  3.5).  The  subevents  which  are  evident  in  the  time  domain  manifest  themselves 
only  as  a  pronounced  modulation  in  the  spectrum  between  approximately  0. 1-1.0  Hz  In 
spectral  form,  it  would  be  difficult  to  obtain  any  intuition  about  the  rupture  process.  Also, 
the  spectral  modulation  might  tend  to  bias  estimates  of  attenuation  based  on  the  spectral 
decay  unless  one  averages  the  spectra  of  many  events  with  different  rapture  histories  from 
the  same  region. 


234 


3.3  COMPUTATION  OF  BODY  WAVES 

At  teleseismic  distances,  the  cone  of  taKe-orf  angles  for  the  seismic  rays  describing 
the  earth’s  response  between  source  and  receiver  is  very  narrow,  allowing  source  effects  to 
be  decoupled  from  propagation  effects.  In  order  to  model  body  waves,  our  experimental 
procedure  is  to  find  those  source  and  propagation  operators  which,  when  convolved  with 
each  other,  match  the  observed  data  the  best.  In  this  section  we  first  describe  our  method 
for  computing  source  and  propagation  operators.  Then  we  describe  the  constraints  which 
minimize  the  tradeoff  between  between  these  operators.  It  turns  out  that  the  constraints 
are  strong  because  the  dispersive  property  of  intrinsic  attenuation,  which  dominates  the 
propagation  operator,  generally  contrasts  distinctively  with  the  energy  flux  predicted  by 
causal  rupture  models. 

Source  Operator 

A  simple  paran  eterization  of  the  source  spectrum  is  often  employed  by  studies  of 
attenuation  in  the  Cequency  domain  to  correct  for  source  effects.  It  consists  of  assuming 
that  the  amplitude  maintains  a  constant  level  that  is  proportional  to  seismic  moment 
from  low  frequencies  up  to  a  corner  frequency.  Beyond  the  corner  frequency,  amplitude 
decay  would  be  proportional  to  a  power  of  frequency,  usually  between  u~ 2  and  a;-3.  Even 
without  correcting  for  attenuation  this  simple  paradigm  is  obviously  a  poor  approximation 
to  an  actual  displacement  spectrum,  such  as  shown  in  Figure  3.5,  for  all  frequencies  but  those 
which  are  asymptotically  low  (/  <<0.1  Hz)  or  which  are  asymptotically  high  (/  >>1.0 
Hz).  For  intermediate  frequencies  the  spectra  may  have  a  transitional  falloff  and  may  be 
strongly  modulated.  The  behavior  of  the  transitional  region  can  depend  on  the  complexit> 
and  finiteness  of  the  source  (Boore  and  Joyner,  1978).  It  would  even  be  difficult  to  choose 
an  unequivocal  corner  frequency.  Boatwright  (19*4)  has  pointed  out  that,  in  a  multiple 
rupture,  the  corner  frequency  of  the  dominant  subevent  produces  the  strongest  change  in 
the  amplitude  spectrum.  For  example,  the  strongest  inflection  in  the  spectrum  of  Figure 
3.5  occurs  at  about  1  Hz.  The  inverse  of  this  frequency  corresponds  to  the  duration  of  the 
stronger  subevent  of  the  double  pulse  at  COL  (Figure  3.4)  rather  than  to  the  total  duration 
of  the  displacement  pulse. 

Our  experience  with  broadband  data  suggests  that  the  majority  of  moderate-sized 
deep  earthquakes  rupture  in  a  complex  fashion.  With  broadband  data,  systematic  vari¬ 
ations  in  waveform  about  the  focal  sphere  can  be  seen  and  related  to  source  complexity 
for  earthquakes  as  small  as  nib  5.5  (Choy  and  Boatwright.  1981).  In  order  to  model  the 
broadband  intormation  it  is  essential  that  a  suitably  realistic  model  of  rupture  be  used.  We 
follow  the  method  of  Choy  and  Boatwright  (1981)  to  generate  our  source  operators.  An 
important  aspect  of  the  procedure  is  that  constraints  on  the  rupture  process  are  obtained 
primarily  through  modeling  the  velocity  pulse  shapes.  Velocity,  being  directly  related  to 
energy  radiated  through  the  focal  sphere,  naturally  emphasizes  energy  near  and  above  the 
corner  frequency  of  an  event.  For  each  earthquake  all  the  velocity  pulse  shapes  are  exam- 
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ined  as  a  function  of  takeoff  angle  and  azimuth  about  the  fault  plane  in  order  to  identify 
features  which  are  coherent  or  which  vary  systematically  from  station  to  station.  Source 
operators  are  then  obtained  by  modeling  the  rupture  process  with  the  quasidynamic  mod¬ 
els  of  Boatwright  (1980).  These  models  describe  rupture  growth  as  self-similar  and  rupture 
healing  as  related  causally  to  the  stopping  of  rupture  growth.  At  asymptotically  high  fre¬ 
quencies,  the  spectral  falloff  of  these  models  varies  between  u~2  and  w~3,  depending  on 
the  healing  of  the  rupture.  Thus,  at  high  frequencies  these  models  conform  to  asymptotic 
corrections  used  in  many  spectra!  amplitude  approaches. 

Propagation  optrator 

Propagation  effects  include  geometrical  spreading,  attenuation  and  interactions  with 
discontinuities  in  the  earth.  We  first  discuss  the  attenuation  operator  which  is  the  domi¬ 
nant  propagation  effect  in  the  coherent  portion  of  the  waveforms.  Then  we  discuss  inter¬ 
actions  with  earth  structure,  which  generally  produce  minor  and  late-arriving  features  in 
the  waveforms. 


Attenuation  operators  can  be  computed  analytically  by  any  of  several  methods  of 
seismogram  synthesis,  such  as  the  full  wave,  reflectivity  and  WKBJ  methods  (e.g.,  Cormier 
and  Richards,  1976;  Kennett,  1975;  Chapman.  1978).  if  attenuation  is  both  dispersive 
and  frequency  dependent,  the  computation  becomes  expensive  and  time-consuming.  Our 
approach,  therefore,  is  to  first  compute  dispersive  operators  conforming  to  the  Kramers- 
Kronig  relations.  The  operators  that  fit  our  data  best  are  then  related  to  Q  models  that 
generate  the  same  time-functions  for  the  same  propagation  paths  in  the  earth. 

Assuming  asymptotic  ray  theory,  linear  visco-elasticity,  and  no  scattering  losses,  the 
complex  spectrum  of  the  Creen  s  function  for  a  body  wave  propagating  from  a  reference 
point  Xq  on  the  rupture  surface  to  a  receiver  at  x  is  given  by 


g(x,u)  =  G0(x,x0)A(u).  (3.1) 

The  junction  G0  is  a  product  of  a  function  of  tuc  densities  and  wave  velocities  at  Xo  arid 
x.  a  radiation  pattern  appropriate  for  the  wave  type  (P  or  S)  and  the  take-off  angles  for 
the  ray  from  x0  to  x.  and  the  geometric  spreading  from  x0  to  x.  A(u)  is  defined  b\ 

A(w)  =  e,w*M  (3.2) 


and  includes  the  attenuation  and  phase  shifts  due  to  propagation  and  dispersion.  I  hi 
complex  travel  time  T (w*)  in (3.2)  s  simply  the  integral  of  the  complex  slowness  along  the 
ray  path: 


(3.3) 


The  complex,  dispersive  velocity  <:(*•)  in (3.3)  is  given  by  c(w)  =  \  where  ,\)i_ 

\  P 

is  the  complex  elastic  modulus  determined  by  the  parameters  of  a  linear,  visco-elasi 
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rheology.  By  invoking  causality,  a  relation  between  Re  f(u)  and  Im  f(u)  can  be  found, 
which  is  equivalent  to  the  Kramers-Kronig  relations  for  the  complex  index  of  refraction  of 
electromagnetic  waves  or.  to  those  for  the  complex  velocities  of  seismic  waves  (Futterman, 
1962).  For  linear  visco-elasticity,  the  fastest  arriving  frequency  component  will  be  the 
infinite  frequency  (Gurtin  and  Herrera,  1965).  Thus  causality  can  be  satisfied  by  requiring 


that  the  time  domain  operator  /l(t)  =  0,  until  t  =  Re  f( oo)  =  J 
requirement  can  be  shown  to  lead  to  the  relation 

[Re  T{u)  -  Re  f{ oo)]  =  H\Im  f(w)), 
where  H  is  the  Hilbert  transform. 
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frequency  in  the  band  of  interest.)  Using  (3.4) and  the 
definition  of  t",  the  complex  spectrum  of  the  attenuation  operator(3.2):an  be  rewritten  as 
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For  a  generally  frequency  dependent  t\  the  shape  in  the  time  domain  of  .4(f)  can 
be  found  by  calculating  the  Hilbert  transform  of  f’(/)  and  inverse  Fourier  transforming 
(3.6).  T(x/  need  not  be  known  and  can  be  incorporated  into  a  reference  time  needed  to 
bring  the  apparent  start  of  the  pulse  into  the  finite  time  window  determined  bv  the  folding 
frequency  of  a  fast  Fourier  transform.  The  frequency  dependence  of  t"  in  (3.6),  however, 
cannot  be  completeK  arbitrary.  Some  restrictions  on  the  allowable  frequency  dependence 

of  t  (/)  are  imposed  by  linear  visco-elasticity.  In  particular,  since  t'  -  J  — — ds,  the 

frequency  dependence  of  f‘  must  be  consistent  with  Q  =  C  fn ,  where  n  <  1  (Minster  and 
Anderson,  1981).  Also  the  width  and  intensity  of  an  absorption  band  at  any  given  depth 

must  give  a  positive,  non-zero  relaxed  modulus  at  the  zeroth  frequency  limit  (Minster 
1978). 

For  linear  elasticity  and  causal  propagation,  the  form  of  the  attenuation  operator  will 
be  a  minimum-delay  pulse  that  arrives  after  the  reference  arrival  time.  Taking  the  attenua 
tion  operator  to  be  minimum  phase  implies  that  stochastic  processes  such  as  scattering  and 
multipathing  are  negligible.  This  is  appropriate  and  even  desirable  as  we  are  only  model¬ 
ing  the  features  of  waveforms  that  are  coherent  over  the  focal  sphere.  To  insure  that  ue 
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Figure  3.6.  [Tup]  The  broadband  velocity  records  from  stations  A1  (solid  line)  and  ('! 
(dashed  line)  of  the  GRF  array.  The  stations  are  about  85  km  apart.  The  coherent  portion 
of  the  waveforms  are  easilv  identified  as  the  first  two  peaks  and  the  subsequent  trougn. 
{ Bottom )  The  velocity  waveforms  from  northern  European  stations:  the  A1  station  of 
the  GRF  array  (solid  line).  KEY  (short  dashes),  and  BER  |long  dashes).  The  coherent 
component  of  the  waveforms  is  also  easily  identifiable  on  this  teleseismic  scale. 
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are  not  overinterpreting  details  of  the  waveforms  with  deterministic  models  of  rupture  and 
propagation,  we  would  like  to  obtain  a  measure  of  the  stability  and  reliability  of  broadband 
pulses.  This  can  be  done  by  following  a  suite  of  waveforms  across  the  broadband  stations 
of  the  large  aperture  GRF  array.  The  top  part  of  Figure  3.6  shows  the  typical  variation  of 
velocity  pulse  shapes  for  a  deep  earthquake  across  the  array.  Any  differences  among  wave¬ 
forms  would  be  expected  to  show  up  most  glaringly  in  velocity,  which  emphasizes  rapid 
changes  in  displacement.  Yet  the  dominant  features  of  any  velocity  waveform  are  easily 
found  in  the  other  traces.  In  the  GRF  waveforms,  these  features  are  the  initial  double 
peak  and  subsequent  broad  trough  in  the  velocity  pulses.  The  duration  of  the  coherent 
portion  of  the  velocity  waveforms  (about  4  seconds  from  the  onset  of  P)  corresponds  *o 
the  duration  of  the  dynamic  rupture  process.  Outside  this  coherent  window  the  codas  of 
all  the  traces  fall  rapidly  out  of  phase.  The  minor  variations  among  waveforms  within  the 
time  window  of  coherence  arise  primarily  from  relative  enrichment  or  depletion  of  high 
frequency  energy  which  serves  either  to  enhance  or  to  mitigate,  respectively,  the  sharpness 
and  amplitude  of  the  peaks  and  troughs.  The  overall  stability  of  the  velocity  pulses  persists 
on  a  teleseismic  scale.  The  bottom  part  of  Figure  3.6,  for  instance,  shows  the  pulse  shapes 
for  stations  located  in  Europe.  Differences  among  the  velocity  waveforms  are  confined  to 
bursts  of  high  frequency  energy  which  are  easily  distinguishable  from  the  coherent  part  of 
the  waveform  (e.g,  compare  the  synthetics  for  KEV  with  GRA1  in  Figure  3.2).  \lthough 
differential  amplification  as  well  as  scattering  may  arise  for  different  propagation  paths  as 
frequency  increases  above  about  1  Hz,  it  appears  that  most  of  the  energy  up  to  about  5  Hz. 
which  is  necessary  to  explain  the  complexity  of  the  waveforms,  will  still  arrive  coherently 
within  a  time  window  corresponding  to  the  duration  of  the  rupture.  In  an  examination 
of  stability  of  body  waves  across  the  GRF  array,  Choy  (1981)  observed  that  coherence  in 
phase  persisted  to  higher  frequencies  than  coherence  in  amplitude.  This  observation  is 
consistent  with  the  theory  of  Chernov  scattering.  Fluctuations  in  the  logarithm  of  the 
amplitude  spectrum  are  on  the  same  order  as  fluctuations  in  the  phase  spectrum.  Thus, 
fluctuations  in  the  absolute  amplitude  will  be  larger  than  those  in  phase. 

As  this  study  is  restricted  to  examining  body  waves  from  deep  earthquakes,  crustai 
correction*  of  direct  P  and  SH  body  waves  need  only  be  considered  for  the  receiver  end 
of  the  propagation  path.  The  crustal  response  near  the  receiver  could  be  one  source  of 
incoherence  among  stations,  as  it  is  not  likely  that  different  stations  would  have  identical 
crustal  responses.  However,  incorporating  the  crustal  response  was  generally  found  to  havp 
little  effect  on  body  waves  during  the  interval  of  coherent  rupture.  This  is  not  surprising 
:  :  body  waves  from  deep  earthquakes  minimizes  the  effect  of  crustal  response 

At  teleseismic  distances  the  angles  of  incidence  of  P  and  SH  waves  are  very  steep.  Becau**’ 
we  consider  only  the  vertical  component  of  P  and  the  trar averse  component  of  SH.  the 
crustal  reverberations  predicted  by  standard  earth  models  are  both  negligible  in  amplitudt 
and  arrive  much  later  in  the  wavetrain  than  the  duration  of  the  rupture.  It  would  require 
local  velocity  inversions  of  rather  large  contrast  (  25%)  to  produce  significant  (>  )0'i 
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amplitudes.  Such  structures  would  also  have  to  be  shallow  in  order  to  produce  arrivals 
in  the  coda  that  immediately  follows  the  P  waves.  The  variability  of  the  P  wave  coda 
could  be  explained  in  part  by  ad  hoc  low-velocity  layers  at  the  surface  or  within  the  crust. 
However,  the  response  of  these  ad  hoc  models  would  still  be  too  small  to  alter  the  coherent 
part  of  the  waveforms  significantly.  Furthermore,  a  portion  of  the  coda  might  not  even  be 
related  to  near-receiver  structure.  Owens  et  al.  (1983)  have  inferred  the  structure  beneath 
stations  of  the  RSTN  network.  Many  of  these  stations  have  low-velocity  sedimentary  layers 
at  the  surface.  Incorporating  the  crustal  responses  into  the  synthetics  did  not  improve  our 
fits.  In  fact,  the  behavior  of  the  theoretical  response  was  occasionally  opposite  to  that 
observed  in  the  data,  suggesting  that  some  other  propagation  effect  can  affect  the  coda. 
Fortunately,  however,  with  the  redundancy  of  data  in  North  America  the  coherent  part 
the  waveform  can  be  identified  easily  and  an  occasional  mismatch  does  not  affect  the 
overall  inversion  for  the  source.  The  velocity  pulses  consistently  exhibit  the  double  peak 
followed  by  a  trough.  More  generally,  the  predominant  coherence  of  displacement  and 
velocity  pulses  over  teleseismic  distances  indicates  that  anomalous  velocity  conditions  do 
not  exist  at  most  of  the  stations  we  used.  As  it  is  not  the  purpose  of  this  paper  to  detail 
crustal  structure,  the  synthetics  are  presented  without  convolution  with  a  crustal  response. 

3.4  MODELING  CONSTRAINTS 

In  our  analysis,  we  derive  the  best  source  and  attenuation  operators  that,  when  con¬ 
volved  with  each  other,  provide  the  optimum  fit  to  both  displacement  and  velocity  data  for 
a  suite  of  pulse  shapes  about  the  focal  sphere.  In  order  to  estimate  the  error  bounds  due 
to  the  nonuniquenes  of  this  convolution,  it  is  necessary  to  examine  the  tradeoff  between 
source  and  attenuation  operators. 

Midband  and  high  frequencies 

For  midbanc  and  high  frequencies  (/  >  0.1  Hz),  the  tradeoff  is  not  large  because 
conditions  imposed  by  intrinsic  attenuation  and  causal  rupture  predict  waveforms  that 
can  behave  in  drastically  dissimilar  fashion.  By  explicitly  considering  broadband  data, 
these  differences  are  distinguishable. 

Intrinsic  attenuation  operators  conforming  to  the  Kramers-Kronig  relations  are  min¬ 
imum  phase  waveforms.  That  is.  they  have  the  minimum  possible  phase  shift  for  their 
amplitude  spectrum  (Robinson.  1967).  In  the  time  domain,  this  describes  an  impulse 
response  in  which  the  energv  buildup  occurs  as  close  as  possible  at  the  beginning  of  the 
waveform.  Examples  of  such  operators  are  shown  in  the  first  two  columns  of  Figure  3.7.  One 
of  the  operators  employs  a  frequency  independent  t '  of  0,6  s.  The  other  operator  uses  laf 
frequency  dependent  t ,  model  that  is  derived  later  in  this  paper.  The  top  trace  shows  the 
displacement  for  each  operator.  The  second  trace  shows  the  corresponding  velocity  trace 
Each  positive  pulse  in  displacement  becomes  a  positive  and  negative  pulse  in  velocity.  The 
last  trace  shows  the  square  of  the  velor  i t y  traces.  As  energy  flux  is  directly’  related  to  veloc - 
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Figure  3.7.  The  displacement.  velocity  and  velocity-squared  records  of  typical  atten¬ 
uation  and  source  operators.  The  first  column  shows  an  attenuation  operator  using  » 
frequency-independent  /'  =  Ofi  s,  Th*1  center  column  uses  the  frequency-dependent  f'!/ 
typical  of  a  high-Q  path  (lower  solid  ime  in  F.gure  *).  The  third  column  shows  the  behavin’ 
of  a  typical  source  operator.  Because  operators  describing  intrinsic  attenuation  are  mini¬ 
mum  phase  operators,  the  amplitude  of  the  first  pulse  of  their  v2  records  always  exceeds 
the  amplitude  of  the  second  pulse.  The  behavior  of  source  operators  is  often  drastically 
dissimilar.  In  this  example,  the  amplitude  of  the  second  pulse  greatly  exceeds  that  of  the 
first  pulse. 
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ity  squared,  the  latter  waveforms  demonstrate  the  two  major  attributes  of  minimum  phase 
operators.  First,  the  preponderance  of  energy  arrives  in  the  first  pulse  of  velocity-squared. 
The  amplitude  of  the  first  velocity-squared  pulse  is  much  greater  than  the  amplitude  of 
the  second  pulse.  Secondly,  the  pulse  width  of  the  first  pulse  in  velocity-squared  is  always 
less  than  the  width  of  the  second  pulse. 

Depending  on  the  rupture  geometry  it  is  possible  for  the  energy  flux  of  a  causal 
rupture  process  to  vary  considerably  between  widely  separated  points  on  the  focal  sphere. 
The  greatest  differences  in  an  asymmetric  planar  rupture,  for  instance,  would  be  seen 
among  pulse  shapes  that  are  radiated  in  the  direction  of  and  behind  the  direction  of 
lip.  However,  by  analysing  only  events  which  meet  rather  simole  criteria,  some  strong 
generalizations  ran  be  made  about  the  behavior  of  energy  flux.  Bv  using  only  events  with 
dip-slip  focal  mechanisms,  the  takeoff  angles  for  rays  recorded  at  teleseismic  distances  are 
then  all  relatively  steep  with  respect  to  either  fault  plane.  Furthermore,  the  teleseismic 
rays  subtend  a  small  cone  in  which  the  variation  of  energy  flux,  although  observable,  does 
not  change  rapidly  with  takeoff  angle.  A  majority  of  earthquakes  then  are  found  to  rupture 
such  that  a  substantial  amount  of  energy  is  radiated  as  the  rupture  decelerates  and  heals. 
An  example  is  shown  in  the  last  column  of  Figure3.7.The  rupture  model  used  to  compute 
the  synthetic  is  the  one  that  is  derived  for  the  dominant  subevent  of  a  deep  earthquake 
that  is  examined  later  in  this  paper.  The  displacement  pulse  has  a  rise  time  and  decay 
time  which  in  velocity  becomes  two  pulses.  The  first  pulse,  called  the  rupture  phase, 
describes  rupture  growth.  The  second  pulse,  called  the  healing  phase,  describes  healing  of 
the  rupture  front.  The  velocity-squared  trace  is  directly  proportional  to  radiated  energv 
flux.  In  contradistinction  to  the  attenuation  operators,  this  source  operator  has  a  velocity- 
squared  function  in  which  it  is  the  amplitude  of  the  second  pulse  which  greatly  exceeds 
that  of  the  first  pulse.  The  pulse  widths  of  the  rupture  and  healing  phases  are  also  of  the 
same  order.  The  gross  dissimilarity  in  how  energy  is  radiated  over  time  is  what  permits 
us  to  reduce  the  tradeoff  between  source  and  attenuation.  (However,  we  emphasize  that 
an  earthquake  with  a  dip-slip  mechanism  does  not  guarantee  that  the  healing  phase  i> 
dominant.  It  is  possible  for  some  rupture  processes  to  mimic  the  behavior  of  a  minimum 
phace  operator  b\  radiating  more  energy  during  the  rupture  phase  than  during  the  healing 
phase.  Those  events  in  which  source  and  attenuation  effects  are  not  easily  distinguished 
are  obviously  not  useful  for  deriving  attenuation  operators.) 

A  symptotically  low  and  high  frequencies 

Synthetic  seismograms  for  broadband  P  waves  in  which  the  dominant  corner  frequenc\ 
is  greater  than  0.1  Hz  are  not  gpnerally  sensitive  to  low  frequencies  (/  <  0.1  Hz).  Our 
constraints  for  low  frequencies  are  based  on  model  ABM  (Anderson  and  Given.  1982)  which 
was  derived  from  da*  a  that  explicitly  considered  normal  mode  and  long-period  body  wa\»> 
data.  The  depth-corrected  value  of  t\  at  0  01  Hz  is  about  0.8  s  for  all  the  teleseismic 
distances  under  consideration.  For  /  =  0.1  Hz.  the  tg  of  model  ABM  is  about  0  6  s 


However,  neither  our  synthetics  nor  the  ABM  model  preclude  values  up  to  0.8  s.  Our 
synthetics  slightly  favor  a  value  of  0.7  s. 

The  spectral  amplitudes  of  asymptotically  high  frequencies  (/  >>  1.0  Hz)  are  two  or 
more  orders  of  magnitude  smaller  than  the  amplitude  of  the  dominant  corner  frequency.' 
Consequently,  constraints  in  the  high  frequency  range  are  easier  to  obtain  by  a  comparison 
of  synthetic  and  observed  spectra  rather  than  in  the  time  domain.  Our  synthetic  amplitude 
spectra  are  usually  in  excellent  agreement  with  data  down  to  5.0  Hz  (e.g.,  Figure  3.5).  As 
our  source  models  have  an  asymptotic  falloff  that  conforms  to  the  asymptotic  corrections 
used  in  many  purely  spectral  approaches,  we  expect  our  error  bounds  for  t‘(f)  at  high 
frequencies  to  be  as  good  as  that  which  would  be  obtained  by  spectral  approaches. 

3.5  P  WAVE  ANALYSIS 

Waveiorms  for  analysis  are  from  a  deep  focus  earthquake  that  occurred  on  20  April 
1985  at  a  depth  of  582  km.  For  the  focal  mechanism  we  use  the  solution  from  first 
motions  of  P  waves  which  is  in  agreement  with  the  body-wave  moment-tensor  solution 
given  by  the  NEIS  (National  Earthquake  Information  Service).  We  have  also  verified  the 
solution  using  SH  and  sSH  polarities  and  relative  amplitudes.  The  other  source  parameters 
derived  for  this  earthquake  by  modeling  the  broadband  P  waves  are  given  in  Table  3.1.  The 
earthquake  consists  of  two  rupture  events,  separated  by  1.0  s.  The  relative  locations  of 
their  points  of  nucleation  are  not  sufficiently  well-constrained  that  a  preferred  fault  plane 
can  be  determined.  The  shape  of  the  source  functions  is  not  affected,  however,  by  the 
choice  of  a  fault  plane  because  the  individual  events  are  approximately  circular  ruptures. 
As  the  areas  of  the  rupture  events  probably  overlap,  it  can  be  inferred  that  the  initial 
nucleation  was  momentarily  stopped  by  a  fracture  barrier.  The  failure  of  this  barrier  then 
resulted  in  the  second  rupture  event. 

In  computing  our  synthetics  the  same  source  model  was  used  for  every  station.  The 
<’(/)  operator  was  allowed  to  vary  slightly  in  order  to  accommodate  the  reality  of  reg'ona! 
variations  in  attenuation.  The  synthetic  seismograms  obtained  bv  modeling  the  displace 
ment  and  velocity  records  are  shown  in  Figure  3.2.  Figure  3.5  shows  a  typical  fit  between  our 
synthetic  and  the  observed  data  in  the  frequency  domain.  As  expected,  a  good  fit  in  the 
time  domain  tranforms  to  a  good  fit  in  the  frequency  domain.  It  must  be  remembered, 
however,  that  in  order  to  arrive  at  the  good  spectral  fit  across  a  continuous  frequency  rang*' 
from  0.02  to  5  Hz  we  first  had  to  model  in  the  time  domain.  The  amplitude  spectrum  b\ 
itself,  devoid  of  ari\  phase  information,  is  normally  insullicient  to  give  any  intuition  abom 
the  dynamic  behavior  of  the  rupture. 

In  a  radially  homogeneous  earth  and  our  frequency  band  of  interest.  t\(f)  is  a  weak 
function  of  distance  for  teleseismic  body  waves.  Regional  variations  must  exist.  however 
as  we  still  required  two  types  of  operators  to  synthesize  the  waveforms  (Figure  3.8).  These 
operators  are  appropriate  for  rays  that  propagate  through  apparent^  lower  and  mgrn  • 
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Parameter 

Value 

Depth 

582  km 

Strike 

236 

Dip 

12“ 

Rake 

-90 

Moment 

3  8  x  10”  dyn  cm 

Rupture  radius  1 

2  3  km 

Rupture  radius  2 

3.5  km 

Table  3.1  Source  Parameters  of  the  Sea  of  Okhotsk  Earthquake  of  April  20,  1984  (Origin  Time 
0631:10:6;  50.12°N,  148.73^) 

The  earthquake  consisted  of  two  subevents  that  had  approximately  circular  geometries.  The  second 
subevent  occurred  1.0  s  after  the  initial  event.  The  relative  location  of  the  second  event  is  not 
well  constrained,  but  its  rupture  area  probably  overlaps  the  rupture  area  of  the  first  subevent. 
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Figure  3.8.  The  frequency  dependence  of  t'a  for  a  source  at  depth  600  km.  Two  types 
^ctU )  curves  (solid  lines)  were  needed  to  fit  the  observed  waveforms.  The  two  curve6 
converge  at  about  0.3  Hz.  The  long-dashed  line  between  0.01-0.1  Hz  indicates  assumed 
values,  as  our  synthetics  are  not  sensitive  to  this  frequency  range.  At  0.01  Hz,  we  assume 
the  value,  after  depth-correction,  of  tma  given  by  model  ABM  (Anderson  and  Given,  1982). 
The  short-dashed  lines  bracket  the  uncertainty  in  our  attenuation  operators. 
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attenuating  zones  in  the  earth.  Also  in  Figure  3  8  are  dashed  lines  bracketing  thp  uncertainty 
in  our  t'a(f)  model.  The  long-dashed  part  of  the  tma(f)  curve  indicates  the  frequency  range 
for  which  the  data  are  not  sensitive.  Accordingly,  we  take  the  value  of  t'a  at  .01  Hz  from 
model  ABM.  At  0.1  Hz,  our  preferred  value  is  0.7  s.  The  convergence  of  our  low-  and  high- 
Q  t'a  operators  indicates  that  we  can  resolve  no  strong  regional  differences  for  frequencies 
less  than  abcut  0.3  Hz.  This  does  not  mean  that  regional  differences  do  not  persist  to 
lower  frequencies.  It  merely  indicates  that  our  data  set  cannot  resolve  such  differences 
within  the  indicated  error  bounds.  Der  and  Lees  (1985),  for  example,  analysed  different 
data  that  suggest  that  regional  differences  in  attenuation  persist  down  to  0.01  Hz. 

Waveforms  at  stations  BER,  RSSD,  CHTO  and  COL  could  be  synthesized  by  using 
*’(/)  typical  of  the  high-Q  path.  All  other  stations  except  HON  have  waveforms  that  could 
be  synthesized  by  the  ta(f)  for  the  low-Q  path.  The  HON  waveform  required  a  nearly 
constant  ta  =  0.6s  for  all  frequencies  greater  than  0.1  Hz.  This  anomalous  curve  is  not 
included  in  Figure 3.8.  The  shape  of  these  t'a  curves  are  similar  in  principle  to  that  of  Der  et 
al.  (1982,1985),  but  our  results  do  not  preclude  downward  concavity  nor  some  complexity 
1  ucb  as  an  inflection)  in  the  (f)  curve  in  the  midband  range. 

3.6  S  WAVE  ANALYSIS 

In  order  to  obtain  the  attenuation  operator  for  S  waves,  we  find  the  minimum  phase 
operator  that,  when  convolved  with  the  source  model  derived  from  the  P  wave  analysis,  best 
fits  the  displacement  and  velocity  pulse  shapes.  For  a  finite  source,  the  major  theoretical 
differences  between  P  and  S  wave  spectra  involve  a  corner  frequency  shift  and  the  rate 
of  spectral  falloff  beyond  the  corner  frequency.  For  the  earthquake  we  are  modeling,  the 
corner  frequency  of  the  dominant  subevent  is  shifted  by  a  factor  of  0.7  and  the  spectral 
falloff  is  approximately  u; 1  5.  A  typical  fit  between  our  synthetic  and  observed  S-wave 
spectra  is  shown  in  Figure  3.9.  Noise,  primarily  signal-generated,  usually  exceeds  signal 
for  frequencies  above  about  2.0  Hz.  In  the  time  domain,  the  observed  and  svnthetic 
seismograms  of  displacement,  velocity  and  v elrw'itv-sauared  [v~)  are  shown  in  Figures  3.10 
and  3.11  for  S  and  ScS  body  waves,  respectively. 

In  an  earth  where  attenuation  is  depth  and  frequency  dependent,  t'p(f)  may  van 
noticeably  with  distance.  A  typical  tg(f)  curve,  appropriate  for  a  distance  of  50°.  is  shown 
in  Figure  3.12.  This  curve  was  derived  from  analysis  of  the  S  wave  recorded  at  R.CNT.  We 
now  describe  the  constraints  provided  hv  RSNT.  The  derivation  of  t  ’,( f)  for  other  distances 
follows  a  similar  procedure.  Our  broadband  S  waves  (Figure  3.10)  are  not  sensitive  to  very 
low  frequencies  (  /  «  •  0.1  Hz).  The  lack  of  constraint  at  low  frequencies  is  'muieat«-q 
by  the  long-dashed  portion  of  the  t  j|/)  curve.  At  0.01  Hz.  we  assume  the  value  of  the 
ABM  model  (Anderson  and  Given,  1982)  which,  after  correction  for  a  source  at  depth.  is 
t$  —  3.7  s.  However,  in  this  low  frequency  range,  the  synthetic  waveforms  are  not  sensitive 
to  Values  of  tp  as  high  as  4.0  s.  which  is  a  value  suggested  bv  using,  after  correction  for 
depth,  the  QSrS  of  Sipkin  and  Jordan  (1979).  Constraints  near  0.1  Hz  up  to  about  2  0  Hz 
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Figure  3.11.  Broadband  ScS  waves  of  displacement,  velocity  and  \eloc  iiv-squared  (solid 
li.ies).  synthetic  records  are  plotted  as  dashed  lines. 
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of  5FoToHd2;,„  J1' rseqrncy,  dr;?“" of  for  a  wurce  at  depth  g°°  km  -d  > 

,,,.  svnt h  ’  l""*'d“h"d  lm*  bpl'wn  *01-0.1  Hz  indicates  assumed  values,  as 

our  s>  nthelics  are  not  sens, live  to  /  «0.1  Hz.  At  0.01  Hz  and  for  a  depth  of  600  km  we 

K  “  """  ^  ^  ft LM  m°del  Whkh  ‘i  °f  ••><».*  3-7  ..  The  shou-dlhld 
lines  bracket  the  uncertainty  of  this  attenuation  operator. 
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are  much  stronger.  In  Figure  3. lO/.he  S  wave  displacement  at  RSNT  is  characterized  by  a 
steep  rise  in  displacement  followed  by  a  very  slowly  decaying  tail.  Frequencies  near  0.1  Hz 
control  the  rate  of  falloff  for  most  of  the  long  tail..  Using  a  preferred  value  of  3.5  s  at  0.1  Hz, 
the  tail  of  the  synthetic  slightly  underfits  the  tail  of  the  data.  The  underfit  compensates 
for  what,  in  our  estimation,  is  the  level  of  coda  noise.  The  upper  bound  at  0.1  Hz  is  a 
value  which  generates  a  synthetic  tail  that  overfits  the  observed  decay  in  displacement. 
The  lower  bound,  about  3.2  s,  is  a  value  where  the  tail  of  the  synthetic  displacement  is 
obviously  too  sharp  to  be  compensated  by  the  addition  of  noise  on  the  same  level  as  the 
coda.  By  emphasizing  radiated  energy,  the  v 2  record  provides  constraints  for  the  higher 
frequencies.  In  v2,  the  rise  time  in  displacement  becomes  a  pulse  in  which  is  contained 
almost  all  the  energy  of  the  body  wave.  The  remainder  of  the  energy  is  concentrated 
in  the  second  r2  pulse  of  similarly  short  duration  but  markedly  smaller  amplitude.  The 
slowly  decaying  tail  of  the  S  wave  displacement  apparently  contributes  virtually  no  energy 
to  the  v2  record.  The  duration  in  which  the  preponderance  of  energy  is  concentrated 
provides  strong  constraints  on  frequencies  at  and  above  0.3  Hz.  The  lower  bound  of  the 
uncertainties  in  t'p(f)  for  this  frequency  range  is  where  the  velocity  seismograms  begin 
to  exhibit  unacceptable  high-frequency  features.  An  upper  bound  for  t’p  is  chosen  at  the 
value  where  the  resulting  synthetics  are  so  depleted  of  high  frequencies  that  the  durations 
of  the  observed  v2  pulses  cannot  be  matched. 

3.|  A  MODEL  OF  Qj{r,f) 

The  best  fitting  t j(/)  measurements  for  S  and  ScS  phases  at  varying  distance  ranges 
were  used  to  determine  a  frequency  and  depth  dependent  model  of  Qp{r,  /),  where  r  is 
radius.  Broadband  S  waves  were  primarily  only  available  from  the  subset  of  RSTN  stations 
Thus,  it  was  possible  to  apply  our  procedure  for  determining  t $  only  to  these  stations.  As 
previously  described  in  the  P-wave  analysis,  some  regional  dependence  of  attenuation  for 
frequencies  near  and  above  1.0  Hz  can  be  seen  in  the  comparisons  of  broadband  P-waves. 
At  these  frequencies,  the  attenuation  of  S  waves  is  so  strong  that  regional  differences  in 
tj  would  not  be  discernible.  It  may  be  assumed  that  the  S  wave  paths  available  from  the 
RSTN  stations  traversed  a  region  of  nearly  uniform  attenuation.  No  attempt  was  made  to 
invert  for  regional  differences  among  the  S  wave  paths  to  the  RSTN  stations. 

Within  the  bounds  of  the  source-attenuation  tradeoff,  no  evidence  for  bulk  attenuation 
was  found  for  any  pair  of  P  and  S  propagation  paths.  For  the  frequencies  for  which  we 
had  the  greatest  constraints  (/~0.1  Hz),  our  t'  operators  favor  4.ot'a.  Synthetics 

computed  using  smaller  ratios  of  jr  fit  the  observed  data  worse  than  synthetics  using 
higher  ratios.  This  would  be  consistent  with  the  assumption  that  attenuation  occurs  !r. 
pure  shear.  Along  paths  for  which  the  ratio  j  is  constant,  the  assumption  of  pure  shear 
in  attenuation  leads  to  the  relation 

(3.7) 
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In  the  most  highly  attenuating  region  of  our  Qp{r,f)  model  of  the  upper  mantle,  the 
isotropic  PREM  model  (Dziewonski  and  Anderson,  1981)  has  (  ))  =  1.8l  at  1  Hz.  Substi¬ 
tuting  this  in  (7)  gives  tp=\.bt'a. 

There  are  many  possible  parameterizations  of  Qp{r,f )  that  are  consistent  with  the 
observed  tp(f)  functions.  A  general  requirement  of  the  data  is  that  the  t'g(f)  decreases 
gradually  with  increasing  frequency  in  the  0.1  to  1  Hz  band.  The  rate  of  this  decrease  is 
slower  than  that  predicted  by  a  simple  absorption  band  cut-off,  in  which  the  cut-off  fre¬ 
quency  is  constant  with  depth,  and  above  which  Q  increases  as  the  first  power  of  frequency. 
To  fit  the  observed  tp(f)  curves  within  the  estimated  error  bounds,  either  this  cut-off  fre¬ 
quency  must  vary  with  depth,  or  the  power  n  in  a  Q  frequency  dependence  Q  =  Cfn 
varies  with  depth,  or  both.  These  are,  however,  the  general  and  expected  features  of  all 
models  that  seek  to  tie  a  Qp(r,f)  model  to  a  rate-activated  process  that  depends  on  the 
temperature  and  pressure  profiles  of  the  Earth’s  mantle.  Such  models  have  been  proposed 
by  Lundquist  and  Cormier  (1980)  and  Anderson  and  Given  (1982),  with  parameters  ad¬ 
justed  to  fit  seismic  data  over  the  frequency  band  from  Chandler  wobble  to  10  Hz  body 
waves.  In  these  models  the  frequency  and  depth  dependence  of  Qp  is  related  to  the  (P,T) 
profiles  of  the  mantle  through  an  assumed  dependence  of  the  longest  relaxation  time  T\ 
(or  high  frequency  cut-off  fc  =  ^-)  of  the  form 


=  t0  exp 


( E '  +  PV) 
RT 


where  E  and  V  are  an  activation  energy  and  volume  change,  respectively,  associated  with 
the  movement  of  Avogadro's  number  of  defects  over  their  respective  potential  barriers.  The 
uiji.her  time  constant  ro  ior  low  frequency  cut-off)  of  the  relax-ation  spectrum  of  the  mantle 
can  be  constrained  to  some  extent  by  the  gravest  modes  of  oscillation  of  the  Earth  and  by 
the  period  and  attenuation  of  the  Chandler  wobble.  Minster  and  Anderson  (1981)  suggest 
that  a  five  decade  band  of  frequencies  is  a  typical  width  to  expect  for  the  relaxation 

spectrum  of  the  mantle,  and  the  model  of  Anderson  and  Given  (1982)  fixed  the  ratio  P 
to  105.  ' 1 

A  model,  which  is  consistent  with  the  measured  t$(f)  curves  in  the  present  study, 
can  be  found  by  perturbing  the  parameters  of  either  the  double  absorption  band  mode! 
of  Lundquist  and  Cormier  (1980)  or  thp  parameters  of  the  model  by  Anderson  and  Given 
(1982)  Table  3.2  dsts  one  of  the  possible  adjustments  in  the  Anderson  and  Givpn  model 
that  will  produce  good  fits  to  the  /;,(/)  curves.  This  model,  although  giving  a  relatively 
complex  depth  and  frequency  dependence,  is  specified  by  a  small  number  of  parameters 
three  layers,  a  value  of  n  in  each  layer;  a  fixed  ^ ;  ratio,  a  power  law  for  the  frequence 
dependence  of  Qq  between  r2  and  tx  \  and  a  minimum  Qj  =  80  and  Qx  =  oo.  A  search 
for  fits  to  observed  t\  curves  consisted  in  simplv  tabulating  S.  ScS,  and  sS  travel  times 
through  the  three  layers  and  testing  various  assumptions  for  a  power  law  n  in  a  relation 
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Qp  -  Cfn  .  The  value  of  n  was  allowed  to  vary  in  each  layer  and  allowed  to  take  on 
different  values  in  different  frequency  bands  in  the  range  |nj  <  1.  The  starting  model  of 
Qp(r,  f)  was  taken  to  be  that  of  Anderson  and  Given.  Since  this  model  was  designed  to  fit 
a  broad  band  of  low  frequency  data,  including  free  oscillations  and  Chandler  wobble,  an 
attempt  was  made  to  keep  the  perturbations  to  this  model  small  and  confined  to  the  mid 
and  higher  frequency  band  of  body  waves  constrained  by  our  data.  Even  with  the  small 
number  of  free  parameters,  however,  many  possible  combinations  can  be  found  that  will  fit 
the  observed  tp{f)  curves.  In  particular,  the  power  law  between  r2  and  T\  is  rather  weakly 
constrained  as  well  as  the  frequency  of  transition  between  a  specified  power  law  and  a 
/  1  decay  of  the  attenuation.  The  frequency  dependence  of  attenuation  in  the  uppermost 
400  km  of  the  mantle  was  constrained  by  measurements  of  tm0  from  sS  waveforms.  It 
was  found  that  crustal  reverberations  at  the  bounce  point  of  sS  can  strongly  affect  the 
long  period  tail  of  sS.  introducing  some  trade-off  between  crustal  structure  at  the  surface 
bounce  point  and  the  frequency  dependence  of  attenuation  in  the  band  up  to  1  Hz  in  the 
asthenosphere.  In  the  uppermost  400  km  of  the  mantle,  the  model  given  in  Table  3.2  was 
chosen  to  be  consistent  with  the  Q  model  for  the  same  region  given  by  Anderson  and  Given 
and  the  double  absorption  band  model  of  Lundquist  and  Cormier.  In  all  of  these  models, 
the  relaxation  spectrum  in  the  uppermost  200  -  400  km  of  the  mantle  spans  most  of  the 
body-wave  band  from  0.01  Hz  up  to  several  Hz.  The  frequency  band  of  attenuation  in  the 
uppermost  mantle  may  be  related  to  the  rapid  increase  in  temperature  in  the  upper  100 
km  of  mantle,  which  may  move  T\  towards  low  values,  and  hence  the  cut-off  frequency  to 
higher  values  (Figure  3.13). 

In  the  mid  to  lower  mantle,  if  E'  and  Vm  are  assumed  to  be  approximately  constant 
with  depth,  then  the  effect  of  pressure  will  dominate  that  of  temperature  in  equation  (3.7). 
For  values  of  V  greater  than  4cm3  mole ,  increasing  pressure  with  depth  will  move  rt 
towards  higher  values  (F  igure 3.13). This  moves  the  high  frequency  cut-off  of  the  relaxation 
spectrum  through  the  body -wave  band  towards  lower  frequencies.  At  the  base  of  the 
mantle,  the  eliect  of  pressure  may  be  sufficient  to  move  the  high  frequency  cuttoff  of  the 
relaxation  spectrum  to  a  frequency’  value  below  the  body  wave  band.  A  low  Q  zone  in  th* 
body  wave  band  may  still  he  possible  at  the  base  of  the  mantle  if  the  effect  of  pressure 
is  overridden  by  either  a  thermal  boundary  layer  of  the  types  proposed  by  Jeanloz  and 
Richter  (1978)  and  Jarvis  and  Peltier  (1982)  or  by  a  compositional  change  that  changes 
E  and  V  .  The  S  and  ScS  paths  investigated  in  this  study  do  not  show  any  evidence  of 
a  zone  of  increased  attenuation  at  the  base  of  the  mantle  beneath  the  Canadian  Shield 
Rise  times  are  nearlv  identical  for  S  and  ScS  at  all  stations  and  distance  ranges  ( Figure 
3.14).  Rise  times  are  controlled  by  the  phase  velocity  spectrum  at  /~ 0.3  Hz.  Thus  for  these 
paths,  at  least,  there  are  no  measurable  differences  in  the  attenuation  of  0.3  Hz  and  higher 
frequency  S  waves  traversing  the  lower  1000  km  of  the  mantle  versus  those  bottoming 
higher  in  the  mantle 
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Figure  3.13.  Possible  t |  behavior  ab  a  function  of  depth  for  l'^|  pairs.  E‘  and  l 
are  assumed  constant  with  depth.  7\  =  r0exp  .  with  the  T  profile  taken  frori 

Stacey  (1977)  an(i  the  P  profile  taken  from  Anderson  and  Hart  (1977). 
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Figure  3.14.  A  comparison  of  broadband  S  and  ScS  displacement  from  the  deep  earth¬ 
quake  ol  20  April  1984.  At  each  station  the  ScS  displacement  (dashed  line)  is  time  shifted 
and  normalized  to  the  same  amplitude  as  the  S  displacement  (solid  line).  The  S  waves  gen¬ 
erally  have  a  much  longer  tail  than  the  ScS  waves.  The  difference  decreases  with  distance 
from  the  hvpocehter.  The  distances  are  503  for  RS.NT:  66°  for  RSSD-  67°  for  RSON  anr. 
S2°  for  RSCP. 
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At  frequencies  less  than  0.3  Hz,  however,  evidence  was  found  that  may  support  sig¬ 
nificant  differences  in  the  attenuation  of  S  versus  ScS  waves.  A  conspicuous  difference  is 
observed  in  the  decay  of  the  long  period  tail  of  the  displacement  pulses  between  the  S  and 
ScS  phases  (Fig.  3.14).  The  direct  S  seems  to  have  a  much  longer  tail  relative  to  the  ScS 
phases.  This  difference  disappears  as  range  increases  toward  82°.  The  S  waves  at  GRF  for 
this  event  were  of  poor  quality  because  GRF  was  on  an  SH  nodal  line  for  both  S  and  ScS. 
Fortunately,  S  waves  were  available  for  another  earthquake  that  occurred  in  nearly  the 
same  region  and  that  had  nearly  the  same  type  of  source  function.  The  relative  behavior 
of  the  S  versus  ScS  waveforms  from  this  earthquake  is  also  observed  at  the  GR17  array 
(Figure  3.15). 

The  mechanisms  that  may  be  responsible  for  the  differences  in  the  long  period  tail  of  S 
versus  ScS  include  (1)  crustal  reverberations  and  reverberations  of  ScS  in  layered  structure 
at  the  base  of  the  mantle.  (2)  anisotropic  scattering,  (3)  multipathing,  and  (4)  the  depth 
and  frequency  dependence  of  intrinsic  attenuation.  Of  these,  intrinsic  attenuation  appears 
to  be  the  most  likely  explanation.  The  feasibility  of  each  mechanism  is  summarized  as 
follows: 

Crust  and  D  Reverberations.  As  the  angle  of  incidence  on  the  receiver  crust  is  nearly 
identical  for  S  and  ScS  in  the  distance  range  investigated,  the  difference  in  the  long  period 
tail  cannot  be  simply  explained  by  the  effects  of  the  crustal  response.  Other  possible  mech¬ 
anisms  for  generating  arrivals  that  may  interfere  with  S  and  ScS  may  be  multipathing  in 
laterally  heterogeneous  structure  at  the  base  of  the  mantle  (Cormier,  19«5)  or  postcriticai 
reflection  from  a  velocity  discontinuity  near  the  core-mantle  boundary  (Lay  and  Helm- 
berger,  1983).  For  the  distances  of  RSNT  (50°),  RSSD  (667),  and  RSON  (66"),  the  S  wave 
bottoms  far  above  any  such  structure.  Such  structures  could  only  affect  the  ScS  wave 
Our  calculations  show  that  the  reverberations  associated  with  ScS  in  a  layered  D"  region 
would  not  be  sufficient  to  interfere  destructively  with  any  long-period  tail  of  ScS  between 
50  and  66  .  The  pulse  between  S  and  ScS  at  RSGP  (82")  could  possibly  be  explained  a* 
a  postcriticai  reflection  or  a  multipath  arrival  from  some  structure  near  the  core-mantle 
boundary.  Because  these  possibilities  themselves  require  further  confirmation  we  chose  noi 
to  model  the  intermediate  pulse  at  RSCP  lor  the  time  being. 

Am^otropic  Scattering.  Anisotropic  scattering  may  operate  over  the  mid-  and  deep  mantle 
depths  where  the  rav  path  o!  S  most  strongly  differs  from  that  of  .ScS.  In  this  mechanism 
the  spatial  distribution  of  scattering  heterogeneities  at  these  depths  must  be  such  as  :o 
more  strongly  scatter  the  low  frequency  S  waves  incident  at  the  S  wave  angles  than  tho*e 
incident  at  the  ScS  angles.  Chowdury  and  F rasier  ( 1978)  proposed  anisotropic  scattering  a> 
the  mechanism  that  accounted  for  amplitude  variations  in  short  period  P  and  PcP.  whu? 
could  not  be  explained  by  intrinsic  attenuation.  More  recently,  studies  by  Bache  et  ai 
(1985)  and  Taylor  et  al.  (1985)  suggest  that  scattering  becomes  increasingly  important  for 
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Figure  3.15.  A  comparison  of  broadband  S  and  ScS  displacements  from  other  earthquakes  u  *!i 
depths  ranging  from  160  to  573  km.  (Left)  The  ScS  displacement  (dashed  line)  from  staliorf 
RSNT  has  been  time  shifted  and  normalized  to  the  amplitude  of  S  (solid  line).  The  station  is 
50°from  an  event  in  the  Sea  of  Okhotsk  (February  1,  1981,  h  =  573  km).  (Middle)  S  and  ScS 
at  station  A1  of  the  ORF  array  (A  75’)  from  an  event  in  the  Sea  of  Okhotsk  (April  23,  198-1, 
h  =  414  km).  The  ray  path  to  GR  is  nearly  normal  o  the  orientation  of  the  Kuril-Kamachatka 
slab.  (Right)  S  and  ScS  displacements  at  SLR  (A  82°)  from  an  event  in  Oatamarca  Province, 
Argentina  (June  10,  1985,  h  =  160  km).  The  ray  path  is  nearly  normal  to  the  orientation  of  the 
South  American  slab. 
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pulse  attenuation  as  frequency  increases  above  1  Hz.  This  can  account  for  pulse  dispersion 
in  the  reverse  sense  of  that  expected  of  intrinsic  attenuation;  and  for  apparent  P  attenuation 
becoming  more  nearly  equal  to  apparent  S  attenuation.  Our  analysis  of  S  data,  however,  is 
restricted  to  frequencies  less  than  about  2  Hz,  where  pulse  and  rise-time  coherence  suggest 
that  the  effects  of  scattering  do  not  dominate  the  effects  of  intrinsic  attenuation.  In  this 
frequency  hand,  the  P  and  S  waveforms  analyzed  did  not  evidence  any  significant  amount 
of  reverse  dispersion  or  strong  departures  from  tp  =  4.5r„, 

Multipathing.  W.  Chan  and  P.  Silver  (personal  communication)  have  suggested  that  the 
long-period  tail  of  S  may  be  due  to  multipathing  in  the  vicinity  of  the  descending  Kuril- 
Kamchatka  slab.  This  is  likely  to  be  a  frequency  dependent  effect,  in  which  longer  period 
waves  radiated  from  the  source  region  average  over  the  locally  heterogeneous  structure. 
Thus,  it  is  difficult  to  evaluate  this  hypothesis  without  performing  a  synthesis  of  body 
waves  in  a  three  dimensional  structure  by  a  method  capable  of  including  the  effects  of 
diffraction  and  spatial  averaging  over  a  wavelength.  An  observation  that  must  be  satisfied 
is  that  the  long  period  tail  of  S  does  not  appear  to  be  affected  very  much  by  the  take-off 
azimuth  with  respect  to  the  strike  of  the  slab.  The  long  period  tail  is  observed  at  the  GRF 
array(Figure3.i5))for  a  path  that  takes  off  at  an  azimuth  nearly  perpendicular  to  the  strike 
of  the  slab,  and  at  the  RSTN  stations  (Figure 3.14), for  paths  that  take  off  at  azimuths 
sub-parallel  to  the  slab.  The  long-period  tail  of  S  was  also  observed  at  GRF  and  RSTN 
stations  for  events  that  differed  in  depth  by  as  much  as  150  km  and  in  relative  location 
bv  4  .  The  azimuthal  and  depth  independence  of  these  observations  may  prove  difficult  to 
satisfy  with  any  model  of  slab  multipathing. 

Intrinsic  Attenuation.  The  model  of  Qj(r,f)  given  in  Table  3.2  was  designed  to  m?tch  the 
observed  similarities  between  S  and  ScS  as  well  as  their  differences  in  the  long-period  tail. 
The  theoretical  t 3  s  for  66  and  a  depth  of  600  km,  for  example,  are  shown  in  Figure 
3.16.  Note  that  t ,  of  S  differs  from  the  t‘3  of  ScS  in  the  low  frequency  band  but  is  nearlv 
identical  in  the  band  above  0.3  Hz.  The  comparison  of  S  and  ScS  displacements  (Figure 
3.14)  shows  the  design  criteria.  The  rise  times  of  S  and  ScS  are  nearly  indistinguishable. 
Our  modeling  shows  that  the  high  frequency  band  (/  0.3  Hz)  controls  the  rise  times, 

while  frequencies  below  about  0.3  Hz  control  the  decay  of  the  tail.  In  order  to  match  our 
best-fitting  t3  curves,  our  model  of  Qg(t.f)  eliminates  most  attenua’ion  in  the  body  wave 
Wnd  of  seismic  frequencies  in  the  lower  1000  km  of  the  mantle.  It  concentrates  attenuation 
at  mid-mantle  depths  (400  to  2000  km),  primarily  in  the  low  frequency  end  of  the  body 
wave  frequencies.  A  schematic  of  the  earth  using  our  model  of  Qj(r.f)  illustrates  how 
our  bod*  wave  data  are  satisfied  ( Figure  3.17).  ScS  waves  spend  less  time  at  mid-mantie 
depths  compared  to  S  waves,  and  hence  suffer  less  attenuation  at  frequencies  less  than  0  3 
hz.  Both  S  and  ScS  naves,  however,  spend  nearly  equal  times  in  the  upper  mantle,  which 
primarily  controls  the  attenuation  of  frequencies  greater  than  0.3  Hz.  and  hence  the  rise 
times  of  S  and  ^c4'  are  nearlv  identical  The  Q^(r.f)  model  of  Table  3.2,  which  predicts 
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Figure  3.16.  Th^  frequency  dependence  of  t'g{f)  for  S  and  ScS  at  66°  for  a  deep  focu^ 
earthquake  predicted  from  a  Qo(r,f)  model  of  the  type  shown  in  Figure  17. 
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Figure  3.17.  A  schematic  cross-section  of  the  earth  showing  how  the  ray  paths  of  5  and 
ScS  sample  a  mode!. 
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this  effect,  is  consistent  with  a  thermally  activated  model  of  attenuation  with  V  greater 
than  4 cm3/mole,  in  which  the  effect  of  pressure  overrides  that  of  temperature  as  depth 
increases,  and  thereby  moves  fe  through  the  body  wave  band  towards  lower  frequency. 

3.8  CONCLUSIONS 

The  analysis  in  the  time  domain  of  broadband  P  and  S  waves  from  d»ep  focus  earth¬ 
quakes  can  significantly  constrain  the  frequency  dependence  of  the  intrinsic  Q  of  the  earth. 
For  the  paths  investigated  in  this  study,  the  effects  of  intrinsic  attenuation  seem  to  domi¬ 
nate  any  effects  of  scattering  attenuation  on  pulse  shapes  from  low  frequencies  up  to  at  least 
1.0  Hz.  The  gradual  decrease  in  intrinsic  anelasticity  throughout  the  body  wave  band  very 
closely  follows  the  shape  of  t‘a(f)  curves  given  in  Der  et  al.  (1985),  which  they  obtained 
from  a  summary  of  many  spectral  and  time  domain  measurements  of  apparent  attenuation. 
The  gradual  decrease  of  t'  as  a  function  of  frequency  cannot  be  matched  by  any  simple 
parameterization  of  the  frequency  dependence  of  Q,  such  as  a  single  cutoff  frequency  in 
a  relaxation  spectrum  which  is  constant  with  depth.  Although  S  and  ScS  displacement 
pulses  had  nearly  equal  rise  times,  the  pulse  width  of  S  was  substantially  broader  than  the 
pulse  width  of  ScS.  Such  a  feature  can  be  matched  by  a  model  of  C?^(r, /)  in  which  the 
mid-mantle  between  depths  of  400  and  2u00  km  attenuates  primarily  frequencies  less  than 
about  0.3  Hz,  although  frequencies  greater  than  0.3  Hz  are  also  attenuated.  S  and  ScS 
waveforms  from  deep  earthquakes  beneath  the  Sea  of  Okhotsk  observed  at  both  RSTN 
stations  in  North  America  and  at  the  Graefenberg  array  in  Germany  do  not  indicate  the 
existence  of  a  low  Q  zone  at  the  base  of  the  mantle  along  these  paths.  All  waveforms  can 
be  explained  by  a  model  that  has  little  attenuation  throughout  the  body  wave  band  at 
depths  exceeding  2000  km.  Revisions  in  the  high  frequency  cutoff  parameter  in  the  models 
of  proposed  by  Lundquist  and  Cormier  (1980)  and  Anderson  ard  Given  (1980) 

are  all  that  are  required  to  explain  the  pulse  shapes  of  S,  sS  and  ScS  phases  satisfactorily. 
This  suggests  that  the  basic  form  of  Q^(r,f)  is  consistent  with  the  hypothesis  of  these 
papers,  i.e.,  that  Qj  is  due  to  thermally  activated  mechanisms  of  intrinsic  anelasticitv 
with  depth  dependence  controlled  by  the  temperature  and  pressure  profiles  of  the  mantle 
Interpretation  of  the  best-fitting  models  as  a  rate  activated  process  suggests  that 

the  effect  of  temperature  dominates  the  '’fleet  ot  pressure  in  the  upper  mantle,  but  that 
pressure  dominates  temperature  in  the  lower  mantle. 

Broadband  pulse  modeling  combined  with  realistic  source  constraints  offers  a  new 
m^ans  of  determining  the  frequency  dependence  of  attenuation  in  the  earth.  Broadband 
pulses  contain  information  on  the  dispersive  property  of  intrinsic  attenuation  that  can  be 
used  to  retrieve  the  frequency  dependence  of  t‘ .  The  results  reported  here  depend  on  a 
source  analysis  of  P  and  $  waves  and  on  the  existence  of  broadband  digital  recordings  ot 
horizontal  channel  data  This  configuration  of  instruments  has,  until  quite  recenth.  been 
rare.  The  GDSN  dav-tape  has  not  contained  short-period  or  intermediate  period  data  unui 
PSTN  data  were  incorporated  ftito  the  dav-tape.  In  the  future,  greater  global  coverage 


by  broadband  digitally  recording  stations  can  permit  the  mapping  of  the  frequency  and 
regional  dependence  of  attenuation  observed  from  body  waves. 
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